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We present and analyze protocols for fault-tolerant quantum computing using color codes. To 
process these codes, no qubit movement is necessary; nearest-neighbor gates in two spatial dimen- 
sions suffices. Our focus is on the color codes defined by the 4.8.8 semiregular lattice, as they 
provide the best error protection per physical qubit among color codes. We present circuit-level 
schemes for extracting the error syndrome of these codes fault-tolerantly. We further present an 
integer-program-based decoding algorithm for identifying the most likely error given the (possibly 
faulty) syndrome. We simulated our syndrome extraction and decoding algorithms against three 
physically-motivated noise models using Monte Carlo methods, and used the simulations to estimate 
the corresponding accuracy thresholds for fault-tolerant quantum error correction. We also used a 
self-avoiding walk analysis to lower-bound the accuracy threshold for two of these noise models. We 
present two methods for fault-tolerantly computing with these codes. In the first, many of the op- 
erations are transversal and therefore spatially local if two-dimensional arrays of qubits are stacked 
atop each other. In the second, code deformation techniques are used so that all quantum process- 
ing is spatially local in just two dimensions. In both cases, the accuracy threshold for computation 
is comparable to that for error correction. Our analysis demonstrates that color codes perform 
slightly better than Kitaev's surface codes when circuit details are ignored. When these details are 
considered, we estimate that color codes achieve a threshold of 0.082(3)%, which is higher than 
the threshold of 1.3 x 10 -5 achieved by concatenated coding schemes restricted to nearest-neighbor 
gates in two dimensions [Spedalieri and Roychowdhury, Quant. Inf. Comp. 9, 666 (2009)] but lower 
than the threshold of 0.75% to 1.1% reported for the Kitaev codes subject to the same restrictions 
[Raussendorf and Harrington, Phys. Rev. Lett. 98, 190504 (2007); Wang et al, Phys. Rev. A 83, 
020302(R) (2011)]. Finally, because the behavior of our decoder's performance for two of the noise 
models we consider maps onto an order-disorder phase transition in the three-body random-bond 
Ising model in 2D and the corresponding random-plaquette gauge model in 3D, our results also 
answer the Nishimori conjecture for these models in the negative: the statistical-mechanical classi- 
cal spin systems associated to the 4.8.8 color codes are counterintuitively more ordered at positive 
temperature than at zero temperature. 



I. INTRODUCTION 



The promise of fault-tolerant quantum computing is 
a crowning achievement of quantum information science 
[1-8] . Under a specific set of noise and control assump- 
tions, the promise is that any ideal quantum circuit of size 
L can be simulated to any desired precision e by a faulty 
quantum circuit whose size is at most 0(e~ 1 L\og a L) for 
some (small) constant a. Fault-tolerant quantum com- 
puting protocols are judged by the resources they employ 
in the course of a simulation. Examples of such resources 
include the constant a, the hidden constant in the big- 
O notation, and the requirements imposed by the noise 
and control assumptions. Often protocols are compared 
by a requirement encapsulated in a single number, the 
accuracy threshold, which is an upper bound on the error 
probability per elementary operation that a faulty cir- 
cuit must satisfy for the protocol to work. A variety of 
fault-tolerant quantum computing protocols have been 
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developed, with threshold estimates ranging from as low 
as 10~ 6 [9] to as high as 3% [10-12], depending on the 
protocol and the noise and control assumptions. 

An important control constraint relevant for several 
quantum computing technologies is that the only multi- 
qubit gates that are possible are those between nearest- 
neighbor qubits, where the qubits are laid out in some 
2D geometry in which each qubit neighbors a constant 
number of other qubits. Fault-tolerant quantum com- 
puting protocols based on concatenated quantum error- 
correcting codes have a fractal structure that is not 
commensurate with such a geometry. Indeed, forcing 
such codes into a semiregular 2D geometry requires that 
one introduce a substantial number of additional qubit- 
movement operations that expose the protocol to more 
errors, thereby diminishing its accuracy threshold. The 
largest accuracy threshold of which we are aware for a 
concatenated-coding protocol in a semiregular 2D geom- 
etry is 1.3 x 10 -5 [13]; that protocol is based on the 
concatenated nine-qubit Bacon-Shor code [14] embedded 
in the 2D square lattice. 

Cognizant of the constraints imposed by 2D geom- 
etry, Kitaev introduced a family of quantum error- 
correcting codes called surface codes that require only 
local quantum processing, where locality is denned by 
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a graph embedded in a surface [15]. Several fault- 
tolerant quantum computing protocols have been devel- 
oped around surface codes [16-18], and these protocols 
have significantly higher accuracy thresholds than their 
concatenated-coding counterparts. Numerical threshold 
estimates for surface-code protocols range from 0.75% to 
1.1% [17-19]; an analytic proof in Ref. [16] guarantees 
that it is no less than 1.7 x 10~ 4 . 

Recently Bombin and Martin-Delgado proposed a new 
family of quantum error-correcting codes they call color 
codes which are also defined to be local relative to a graph 
embedded in a surface [20] . Specifically, they are defined 
by face-three-colorable trivalent graphs in the following 
way: on each vertex of the graph lies a qubit, and for each 
face / of the graph, one defines two "stabilizer genera- 
tors" or "checks," Xf and Zf. Xf is the tensor product 
of Pauli X operators on each qubit incident on face /, 
while Zf is the tensor product of Pauli Z operators on 
each qubit incident on face /. The color code's codespace 
is defined as the simultaneous +1 eigenspace of each of 
the check operators. 

A fault-tolerant quantum computing protocol based 
on color codes requires an infinite family of color codes 
of increasing size in order to be able to simulate arbi- 
trarily large ideal quantum circuits to increasing preci- 
sion. A natural source for an infinite color-code family 
is a uniform tiling of the plane by a trivalent face-three- 
colorable lattice. Such a lattice can be embedded in any 
orientable surface, although later we will restrict atten- 
tion to embeddings in planar discs. These "semiregular" 
or "Archimedean" lattices are described in vertex nota- 
tion as r.s.t, where each vertex is locally surrounded by 
an r-gon, an s-gon, and a i-gon. The only possible triva- 
lent face-three-colorable tilings of the plane are the 4.8.8 
lattice, the 6.6.6 (hex) lattice, and the 4.6.12 lattice, de- 
picted in Fig. 1 [21]. 

(a) 4.8.8 (b) 6.6.6 (c) 4.6.12 

FIG. 1: The three possible face-three-colorable trivalent uni- 
form tilings of the plane. 

Accuracy thresholds for fault-tolerant quantum com- 
puting have been estimated for color codes in several 
highly idealized noise models numerically. The values of 
these thresholds are summarized in Table I, along with 
analogous estimates for a well-studied surface code and 
two recently-proposed topological subsystem codes. This 
table contains numerous gaps, some of which we fill in 
with the results of this Article — the entries containing our 
results are highlighted in bold. The most significant gap, 
which we fill, is an estimate of the accuracy threshold 



for noise that afflicts the individual quantum circuit ele- 
ments used in a fault-tolerant color-code-based quantum 
computing protocol. The accuracy threshold for noise af- 
flicting the circuit model is perhaps the most instructive 
of all table entries. This is because this threshold es- 
tablishes the target error rate per elementary operation 
that a quantum technology must meet to admit fault- 
tolerant quantum computation using these codes. It also 
allows for a fair "apples-to-apples" comparison to the 
high thresholds estimated for Kitaev's surface codes in 
the circuit model. 

In this Article, we analyze the accuracy threshold of 
the 4.8.8 color codes for fault-tolerant quantum compu- 
tation under several noise and control models. We have 
restricted our analysis to protocols which use the decoder 
that identifies the most likely error (MLE) given the er- 
ror syndrome. We formulate the MLE decoder as an in- 
teger program (IP), which in general is NP-hard to solve 
[33]. Although the decoder is inefficient, it establishes 
a threshold that we expect is close the the maximum 
threshold possible for these codes, namely the one ob- 
tainable by an optimal decoder, which identifies the most 
likely logical operation given the error syndrome. For 
small codes, the MLE IP can be solved "offline" ahead of 
time to generate a lookup table that can be used during 
the course of a "live" fault-tolerant quantum comput- 
ing protocol. Our results comprise both numerical esti- 
mates of the accuracy threshold achieved via Monte Carlo 
simulations and a rigorous lower bound on the accuracy 
threshold that we prove using combinatorial counting ar- 
guments. 

The remainder of this Article is organized as follows. In 
Sec. II, we lay out the control model and the three noise 
models we consider. In Sec. Ill, we summarize the prop- 
erties of the 4.8.8 triangular color codes we study, present 
two circuit schedules for extracting the error syndrome in 
these codes, and formulate MLE decoders for these codes 
as integer programs for each of the noise models that we 
consider. In Sec. IV, we report our numerical estimates 
for the accuracy threshold for fault-tolerant quantum er- 
ror correction of these codes for each of the noise models 
that we consider. In Sec. V, we use a self-avoiding-walk 
analysis to prove rigorous lower bounds for the accuracy 
thresholds of fault-tolerant quantum error correction of 
these codes against two of the noise models that we con- 
sider. In Sec. VI, we relate the quantum error correction 
accuracy threshold to the quantum computation accu- 
racy threshold for two scenarios: one in which logical 
qubits are associated with 2D planes that are stacked 
atop one another like pancakes and the other in which 
logical qubits are associated with "defects" in a single 
2D substrate. In Sec. VII we conclude, summarizing and 
interpreting our results both in terms of the accuracy 
thresholds we report and in terms of their consequences 
for "re-entrant behavior" of an order-disorder phase tran- 
sition in two associated classical statistical-mechanical 
models. We cap off our conclusions with some parting 
thoughts about future directions that we believe are wor- 
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"Reference computes threshold against DP channel, not BP chan- 
nel. For non-circuit-based noise models, the decoder used does not 
account for correlations between bit flips and phase flips in DP 
channel. In these models, we reported the result for the equivalent 
effective BP channel of strength \y. 

Decoder based on hypergraph matching heuristic. 

Timited numerics only weakly suggest this value. 

^Decoder based on mapping to two Kitaev codes. 

TABLE I: Numerically-estimated accuracy thresholds for several topological quantum error-correcting codes, noise models, 
and decoding algorithms. The first three codes (4.8.8, 6.6.6, 4.6.12) are the color codes described in Fig. 1 and its preceding 
text. The last three codes are the Kitaev surface code on the square lattice [15], a topological subsystem color code on the 
3.4.6.4 lattice [32], and a hypergraph-based topological subsystem code proposed by Suchara, Bravyi, and Terhal [31]. The 
details of the noise models (code capacity, phenomenological, and circuit-based) and decoders (MLE, optimal, and other) are 
discussed in the text; when possible, results from other references have been translated into one of these models. The notation 
"x.yx ■ ■ ■ yk(z)%" means x.yi ■ ■ ■ yt% ± (z x 10~ fc )%. When such notation is not used, it means that the no error analysis was 
reported in the reference from which the value was drawn. 



thy of study. 

II. NOISE AND CONTROL MODEL 

The performance of a fault-tolerant quantum comput- 
ing (FTQC) protocol is strongly influenced by underlying 
architectural assumptions, so it is important to clearly 
list what they are. Indeed, when those assumptions are 
not borne out in real quantum information technologies, 
an FTQC protocol may fail entirely [34, 35] . 

Every existing FTQC protocol makes the following ar- 
chitectural assumptions — assumptions which appear to 
be necessary: 

1. Nonincreasing error rate. The asymptotic scal- 
ing of the error rate as a function of the circuit's 
size is nonincreasing. This allows the performance 
of fault-tolerant circuits to increase asymptotically. 

2. Parallel operation. The asymptotic parallel- 
processing rate is larger than a constant times the 
asymptotic error rate. This allows error correction 
to keep ahead of the errors. 

3. Reusable memory. The asymptotic rate at 
which one can erase or replace qubits is larger than 
a constant times the asymptotic error rate. This al- 
lows entropy to be flushed from the computer faster 
than it is generated by errors. 



Some FTQC protocols also make the following archi- 
tectural assumptions, which generally lead to higher ac- 
curacy thresholds; we make these assumptions here: 

4. Reliable classical computation. Classical com- 
putations always return the correct result. 

5. Fast classical computation. Classical computa- 
tions are instantaneous. 

6. No qubit leakage. Qubits never "leak" out of the 
computational Hilbert space. 

7. Uncorrelated noise. Each qubit and gate is af- 
flicted by an independent noise source. 

Some additional architectural assumptions, which have 
a less clear impact on the accuracy threshold, are fre- 
quently made as well; we also make these assumptions: 

8. Standard gate basis. The set of (faulty) quan- 
tum gates (including preparation and measure- 
ment) available consists of |0), |+), I, X, Z, T, 
S, CNOT, M z , and M X - The definition of what 
these gates are can be found in standard textbooks, 
e.g., in Refs. [36, 37]. 

9. Equal-time gates. Each gate, including prepara- 
tions and measurements, takes the same amount of 
time to complete. 
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10. Uniformly faulty gates. Each fc-qubit gate, 
including preparations and measurements, is as 
equally as faulty as every other fc-qubit gate. 

Inspired by the limitations of 2D geometry for some 
quantum computing technologies, we also make the fol- 
lowing assumptions: 

11. 2D layout. Qubits are laid out on a struc- 
ture describable by a graph embedded in a two- 
dimensional surface. 

12. Local quantum processing. Gates can only cou- 
ple nearest-neighbor qubits in the graph describing 
their layout. 

Finally, we make the following three variants of a thir- 
teenth assumption about the noise model afflicting each 
gate. Of all the assumptions we make, we have found 
that this one is most likely to vary in the literature. 
Commonly-studied alternatives for this assumption in- 
clude stochastic adversarial noise [38-41], purely depo- 
larizing noise [42], and noise that has a strong bias, such 
as having phase flips significantly more probable than bit 
flips [43]. 

13(a). Circuit-level noise. Each faulty single-qubit 
preparation and faulty coherent single-qubit gate 
(|0), |+), I, X, Z, H, T, S) acts ideally, followed 
by the bit-flip channel of strength p, which applies 
bit flips (Pauli X operators) with probability p, 
followed by the phase-flip channel of strength p, 
which applies phase flips (Pauli Z operators) with 
probability p. We call this channel the BP chan- 
nel. Each faulty single-qubit measurement (Mx, 
Mz) acts as the BP channel of probability p fol- 
lowed by a measurement that returns the incorrect 
result with probability p. Importantly, this noise 
model assumes that the state after a measurement 
is in an eigenstate of the observable measured, just 
perhaps not the eigenstate that the measurement 
indicates. Each CNOT gate acts ideally followed 
by a channel in which each of the 16 two-factor 
Pauli products (II, IX, XI, XY, etc.) is ap- 
plied with probability p/16. We call this chan- 
nel the DP channel. This model differs slightly 
from a frequently-studied variant in the literature 
in which each of the 15 nontrivial two- factor Pauli 
products is applied with probability p/15 and the 
identity is applied with probability 1 — p. 

13(6). Phenomenological noise. This noise model is 
the same as the circuit-level noise model (13(a)), 
except that the circuit for syndrome extrac- 
tion (described later) is modeled "phenomenolog- 
ically," having a probability p for returning the 
wrong syndrome bit value. In this model, the 
propagation of errors between data qubits and 
between data and ancilla qubits induced by the 
syndrome extraction circuit are ignored. Single- 
qubit and two-qubit gates on the data qubits in 



circuits other than those used for syndrome ex- 
traction (e.g., for encoded computation) are still 
subject to the BP and DP channels, respectively, 
as before. 

13(c). Code capacity noise. This model is the same 
as the phenomenological noise model, except that 
the syndrome-bit error rate is assumed to be zero. 
Because there is no need to repeat syndrome mea- 
surements in this model, and because the accuracy 
threshold for "defect-braided" quantum computa- 
tion is the same as that for quantum memory (as 
argued later), the accuracy threshold for this noise 
model is the same as what in quantum informa- 
tion theory is called the single-shot, single-letter 
quantum capacity for color codes subject to the 
BP channel. 



III. FAULT-TOLERANT ERROR CORRECTION 
OF COLOR CODES 

A. Code family 

We confine our analysis of color codes to the 4.8.8 color 
codes; our choice is motivated by two factors. First, of 
the three color codes on semiregular 2D lattices, the 4.8.8 
code uses the fewest qubits per code distance. Second, 
the 4.8.8 code is the only one of the three which can 
realize encoded versions of the entire "Clifford group" 
[36] of quantum gates, namely the gates which conjugate 
Pauli operators to Pauli operators in the Heisenberg pic- 
ture, in a transversal fashion [20], i.e., by applying the 
same operation to every qubit in a code block or be- 
tween corresponding qubits in two code blocks. In par- 
ticular, the gates X, Z, H, S, and CNOT have transver- 
sal encoded implementations for these codes. When en- 
coded gates are implemented transversally, fault-tolerant 
quantum computing protocols for simulating these gates 
are generally simpler, leading to more favorable accuracy 
thresholds. The Clifford group of gates is an important 
group of gates for stabilizer codes such as the color codes, 
since error correction can be carried out solely using those 
gates [44]. 

We further restrict our analysis to planar color codes, 
namely those which are embedded in the disc (a sphere 
with one puncture). We do this because, for all quantum- 
computing technologies of which we are aware, arranging 
qubits on a flat disc is more plausible than arranging 
them on a more general surface like a torus. The graph 
constraints defining color codes require that planar color 
codes have a boundary shaped like a polygon having 3m 
sides for some positive integer m. A 3?n.-sided planar 
color code encodes m logical qubits; we restrict attention 
to the simplest case in which m = 1. In other words, our 
focus on this paper is on triangular color codes. Exam- 
ples of three different triangular color codes are depicted 
in Fig. 2. 
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The code distance of a triangular color code is equal to 
its side length, namely the number of qubits along a side 
of the defining triangle. To see this, notice that the logi- 
cal X and Z operators for the logical qubit are transver- 
sal because they are encoded Clifford gates. Thus, when 
one multiplies a logical X or Z operator by all checks 
of the same Pauli type, except the checks incident on a 
specified side, one obtains an equivalent logical opera- 
tor whose Pauli-weight is equal to the that side's length. 
The family of 4.8.8 triangular codes we study is generated 
according to the pattern depicted in Fig. 3. Note that 
the smallest triangular code (for any of three triangular 
code families depicted in Fig. 2) is equivalent to the well- 
known Steane [7,1,3] code [45]; triangular codes offer a 
way to generate an infinite code family from the Steane 
code by a means other than concatenation [72] . 

(a) 4.8.8 code (b) 6.6.6 code (c) 4.6.12 code 

FIG. 2: Three distance d = 11 triangular codes encoding 
one qubit, drawn from the 4.8.8, 6.6.6, and 4.6.12 lattices 
respectively. For general d, these codes have length n equal 
to \d 2 + d - 1, \d 2 + \ and §d 2 - 3d + § respectively. The 
asymptotic ratio of d 2 to n is highest for the 4.8.8 codes. 




(a) d = 3 (b) d = 5 (c) d = 7 



FIG. 3: 4.8.8 color codes of sizes 3, 5, and 7. 

Although the colors of the faces in a color code have 
no intrinsic meaning for the algebraic structure of the 
code other than constraining the class of graphs on which 
color codes are defined, it is useful to use the colors as 
placeholders in discussions from time to time. To that 
end, we will refer to the colors of the faces as "red," 
"green," and "blue." We will further assign a color to 
each edge so that an edge's color is complementary to 
the colors of the two faces upon which it is incident. We 
will call a set of vertices lying on a collection of edges of 
the same color connected by faces also having that color a 
"colored chain;" an example of a colored chain is depicted 
in Fig. 4. We will assign colors to each side of a triangular- 
code so that the color of the side is complementary to the 
colors of the faces terminating on that side; for example, 
in Figs. 2 and 3, the left sides of the triangles are blue, 
the right sides are green, and the bottoms are red. These 



side colors are indicated explicitly in Fig. 4. 




FIG. 4: A green-colored chain in a triangular code. The chain 
connects a green-colored side of the 4.8.8 triangular code to a 
green octagonal face. If qubits are flipped (are in error) along 
this chain, it will only be detected by this terminal octagonal 
check operator. 



B. Syndrome extraction 

To record each error-syndrome bit, the relevant data 
qubits interact with one or more ancilla qubits and the 
ancilla qubits are then measured. Shor [1], Steane [46], 
and Knill [10] have devised elaborate methods for ex- 
tracting an error syndrome to minimize the impact of 
ancilla-qubit errors spreading to the data qubits. For 
topological codes, however, such elaborate schemes are 
not necessary; a single ancilla qubit per syndrome bit 
suffices. This is because, by choosing an appropriate or- 
der in which data qubits interact with the ancilla qubit, 
the locality properties of the code will limit propagation 
of errors to a constant-distance spread. Using more elab- 
orate ancillas is possible, and in general there is a trade- 
off in the resulting accuracy threshold one must examine 
between the reduction in error propagation complexity 
offered versus the additional verification procedures re- 
quired. Here, we examine the simplest case, with one 
ancilla qubit per syndrome bit. By placing two syn- 
drome qubits at the center of each face / (one for the 
Xf measurement and one for the Zf measurement), the 
syndrome extraction process can be made spatially local, 
in keeping with the spirit of the semiregular 2D geometry 
constraints we are imposing. 

Because color codes are Calderbank-Shor-Steane 
(CSS) codes [47, 48], syndrome bits can be separated 
into those which identify Z errors (phase flips) and those 
which identify X errors (bit flips). These correspond to 
the bits coming from measuring the Xt and Zf opera- 
tors respectively. The circuit for measuring an operator 
Xf is identical to the one for measuring the operator Zf, 
except with the basis conjugated by a Hadamard gate; 
examples of bit-flip and phase-flip extraction circuits for 
the square faces in the 4.8.8 color code are depicted in 
Fig. 5. 

In a full round of syndrome extraction, both Xf and Z f 
must be measured for each face /. One way of schedul- 
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FIG. 5: Six-step circuits for measuring X® 4 and Z® 4 



ing this is to perform all Xf measurements in parallel 
followed by all Zf measurements in parallel. The mini- 
mal number of steps (ignoring preparation and measure- 
ment) for parallel Xf measurements is eight; an example 
of such a schedule is depicted in Fig. 6. The Zf mea- 
surements can be carried out by the same schedule, but 
in the Hadamard-conjugated basis as depicted in Fig. 5. 
A complete syndrome extraction round using this sched- 
ule then takes 20 steps: 10 for the Xf measurement and 
10 for the Zf measurement. For this schedule, one only 
needs to have one, not two, syndrome qubits at the center 
of each face. 




FIG. 6: Simple syndrome extraction circuit schedule. A round 
of X checks is followed by a round of Z checks. The number 
at each vertex corresponds to the discrete time step in which 
the physical qubit at that vertex interacts with the syndrome 
qubit at the face's center via a CNOT gate. The same sched- 
ule is used for both X and Z checks, but with the direction 
of the CNOT gates reversed. 

The circuit for a full syndrome extraction round can be 
optimized to use fewer time steps when both syndrome 
qubits in a face can be processed in parallel. An exam- 
ple of an "interleaved" schedule that uses ten steps is 
depicted in Fig. 7. 

We calculate estimates for the accuracy threshold for 
both schedules, to assess the impact of compressing 
the schedule. Some authors who have reported im- 
proved thresholds for concatenated-coding schemes us- 
ing Bacon-Shor codes attribute the improvement in large 
part to the simplicity of the fault-tolerant Bacon-Shor- 




1,5 2,6 



7,3 8/ 
1,5 2,6 

3,7 4,8 
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FIG. 7: Schedule with X and Z syndromes measured concur- 
rently, in "interleaved" fashion. This schedule takes 8 steps, 
plus an extra step for syndrome qubit preparations, plus an 
extra step for syndrome qubit measurements. The label m, n 
at a vertex indicates that at time step m the qubit at that 
vertex interacts with the X-syndrome qubit via a CNOT gate 
and at time step n the qubit at that vertex interacts with the 
Z-syndrome qubit via a CNOT gate. 



code syndrome-extraction circuit [39]. For color codes, 
a priori, it is not clear that using a simpler syndrome- 
extraction circuit will yield an analogous improvement. 
This is because these circuits are not constructed using 
any fault-tolerant design principles — catastrophic error 
propagation is halted by the codes' structure, not by 
circuit-design principles. It may be the case, in fact, that 
a simpler circuit will allow errors to propagate to a larger 
set of qubits than a less simple one. The set of errors to 
which individual errors are propagated by a syndrome- 
extraction circuit are called "hooks" in Ref. [16]. An ex- 
ample of how an error can propagate to a "hook" using 
the schedule of Fig. 7 is depicted in Fig. 8. 

Neither the 20-step nor the 10-step schedule is neces- 
sarily optimal in the sense of yielding the highest thresh- 
old for a fixed number of time steps; we leave that op- 
timization to others. Indeed any schedule that satisfies 
two constraints is valid: (1) no qubit can be acted upon 
by two gates at the same time and (2) any stabilizer 
generator for an error-free input state (including ancilla 
syndrome qubits) must propagate to an element of the 
stabilizer group for an error-free output state. Satisfy- 
ing this second criterion is not trivial; for example, an 
"obvious" schedule that acts on each face in a clockwise 
fashion in a manner obeying constraint (1) will not sat- 
isfy constraint (2). 

The number of steps in the syndrome extraction round 
can be reduced further to eight steps if we prepare the 
ancillas for the octagon measurements not in single-qubit 
states but in cat-states (\0) m + \l) m )/^/2 and use Shor's 
method of syndrome extraction [1]. (One can also use 
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FIG. 8: A single X error that occurs between time steps five 
and six on the syndrome qubit for measuring X® s , indicated 
by the small red circle, will propagate to other X errors ac- 
cording to the arrows. Note that an even number of X flips 
is equivalent to no flip at all. Errors that propagate to other 
syndrome qubits will not propagate further because the syn- 
drome qubits are refreshed before each syndrome extraction 
round. This particular error causes three data qubits to flip. 
These flips are correctly detected by the yellow-colored syn- 
drome bits. 



four-qubit cat states and create an eight-step schedule, 
as demonstrated in Ref. [49].) Eight steps is the absolute 
minimum possible for syndrome extraction, since each 
qubit must be checked by six different syndrome bits, 
which must also be prepared and measured. While using 
cat states reduces the circuit depth, the cat states need 
to be verified. We opted not to study this schedule be- 
cause the verification is stochastic, which would lead to a 
difficult synchronization problem for a large-sized code. 
That said, such a schedule has the potential to offer a 
larger accuracy threshold. 

Because there is an inherent asymmetry in the order in 
which we choose to perform Xf and Zf measurements, 
we will report two threshold results, one for the Xf mea- 
surements and one for the Zf measurements. When we 
only report one value, we are reporting the lower of the 
two threshold values. For the phenomenological noise 
model, we choose to model the Xf and Zf syndrome ex- 
traction processes as occurring synchronously rather than 
one followed by the other, since so many of the details of 
the circuit are washed away in the model anyway. This 
has the advantage of enabling the accuracy threshold in 
the phenomenological model to be identified with a phase 
transition in an associated random-bond Ising model, as 
described in Ref. [23] . We will discuss this connection in 
more detail in Sees. IV A and VII B. 

Finally, it is worth reminding that the entire syndrome 
extraction round is repeated a number of times equal to 
the distance of the code when measurements are allowed 



to be faulty, such as in the circuit-level and phenomeno- 
logical noise models that we study. This ensures that 
errors in the syndrome bit values can be suppressed as 
well as errors in the data qubits can be suppressed. 



C. Decoding algorithm 

The process of decoding refers to a classical algorithm 
for identifying a recovery operation given an error syn- 
drome, regardless of whether the code from which the 
syndrome was derived is classical or quantum. Impor- 
tantly, decoding does not refer to "unencoding," or per- 
forming the inverse of encoding. For classical linear 
codes, the optimal decoding algorithm is the Most Likely 
Error (MLE) algorithm, which identifies the recovery op- 
eration to be the most likely pattern of bit-flip errors 
given the syndrome. In general, this algorithm is NP- 
hard [33] , but there are many families of codes for which 
the algorithm is known to be efficient. 

For quantum stabilizer codes, MLE decoding identi- 
fies the recovery operation to be the most likely n-qubit 
Pauli-group error given the syndrome. (The process of 
extracting the syndrome forces every error to "collapse" 
onto a definite n-qubit Pauli-group operator, which is 
why it is sufficient to restrict to this family of opera- 
tors.) MLE decoding is not necessarily optimal for quan- 
tum stabilizer codes. This is because quantum error- 
correcting codes can be degenerate, meaning that two 
distinct correctable errors can map to the same error syn- 
drome. Color codes are examples of highly degenerate 
codes. The optimal decoding algorithm for quantum sta- 
bilizer codes instead identifies the recovery operation to 
be one that causes the most likely logical operator to be 
applied after recovery. This is akin to a doctor prescrib- 
ing medicine that is most likely to cure the ailment rather 
than prescribing medicine that cures the most likely ail- 
ment. 

Once a decoding algorithm has identified a recovery 
operation, which is some n-qubit Pauli-group operator, 
it need not necessarily be applied. Because the process 
of applying the recovery operation is subject to faults, 
it is wiser to wait until the end of the computation and 
apply the net recovery operation rather than apply it 
after each decoding step. One can even propagate the 
correction past the final qubit measurements at the end 
of the quantum computation, where the recovery oper- 
ation becomes completely classical and fault-free. The 
catch is that one must (classically) adaptively update 
one's "Pauli frame" after each decoding iteration by per- 
muting the interpretation of the Pauli operators X, Y, 
and Z on each qubit as suggested by the recovery oper- 
ation. (The Pauli operators get conjugated by the Pauli 
error identified by the decoder.) 

For fault-tolerant quantum error correction and a num- 
ber of interesting encoded quantum circuits, only Clif- 
ford gates are required. Since Clifford gates propagate 
Pauli operators to Pauli operators in the Heisenberg pic- 



ture, one can efficiently track the changing Pauli frame 
through these gates, as guaranteed by the Gottesman- 
Knill theorem [50] . One can safely defer applying recov- 
ery operations until after final measurement in each of 
these circuits. However, for universal quantum compu- 
tation, at least one non-Clifford gate is required. In our 
protocols, the only such gate we use is the classically- 
controlled S' gate, depicted later in the circuit of Fig. 24. 
Because this gate propagates a Pauli error to a sum of 
Pauli errors, it is necessary to actually apply the recovery 
operation before all but a constant number of these gates 
in order to prevent the number of terms required to track 
one's "Heisenberg frame" from growing exponentially. 

We develop MLE decoders for triangular 4.8.8 color 
codes for the three noise model settings we study: code 
capacity, phenomenological, and circuit-based. For the 
code-capacity and phenomenological settings, the only 
operations are single-qubit measurements and identity 
gates. This means that they involve no circuitry that 
could map X errors to Z errors or vice-versa. Because of 
this, and because our noise model is one in which single- 
qubit operations are subject to BP channel noise (which 
applies X errors and Z errors independently), decoding 
can factor into bit-flip decoding and phase-flip decoding 
separately. Because color codes are also "strong" CSS 
codes [37], the MLE decoders for bit-flip and phase-flip 
errors are in fact identical; for concreteness, we formulate 
the decoder for Zf syndrome bits here. 



1. Code capacity MLE decoder 

In the code-capacity setting, we have a single error- 
free m-bit syndrome s = (si,...,s m ) T where s/ = 
when Zf is measured to have eigenvalue +1 and Sf — 1 
when Zf is measured to have eigenvalue —1. (The value 
of m is a function of the code size; for the triangular 
n-qubit distance-e? 4.8.8 color code, m = (d + l) 2 /4 — 1 
and n = (d+ l) 2 /2 — 1.) We assign a binary variable x v 
to each vertex v indicating whether or not the recovery 
operation calls for the qubit at vertex v to be bit-flipped 
(have Pauli X applied). The objective of MLE decod- 
ing is to minimize the number of x v variables that are 
assigned the value 1 subject to the constraint that the 
parity of the x v variables on each face is consistent with 
the observed syndrome. This can be expressed as the 
following mathematical optimization problem: 

min ~^^x v (1) 

V 

sto (£)x v = s f V/ (2) 

vef 

i„eB:={0,l}. (3) 

This optimization problem can be expressed as a linear 
binary integer program (IP) over the finite field GF(2) 



as follows: 

min l T x (4) 

sto -ffx = s mod 2 (5) 

xeB", (6) 

where 1 denotes the all-ones vector and H is the parity 
check matrix associated with the Z^-checks. (For color 
codes, this is the face- vertex incidence matrix.) 

To take advantage of well-developed numerical opti- 
mization software, it is helpful to replace the linear alge- 
bra over GF(2) in this mathematical program with linear 
algebra over E,. One way to do this is to introduce "slack 
variables" into the optimization problem. Because each 
check operator in the code has Pauli weight four or Pauli 
weight eight, each row of H has Hamming weight four or 
Hamming weight eight. This means that the /th compo- 
nent of the vector on the left hand side of constraint (5) is 
a sum of four or eight binary x v variables that must equal 
s f modulo 2. The modulo 2 restriction can be dropped by 
replacing s by s + 2zi +4z2 + 8Z3 in the constraint, where 
the Zj are binary "slack variable" vectors that allow the 
LHS to sum to any integer from ... 15. While there 
can be many degenerate solutions to this revised opti- 
mization problem having different Zj values, any solution 
generates the same optimal x as before. By combining 
the Zi variables and the x variables into a single vector 
y = (x T , z^f, z^, z^) T , the slack-variable version of the 
program becomes the following linear binary integer pro- 
gram in which the variables are restricted to be binary 
but in which the linear algebra is over R: 

min c T y (7) 

sto Ay = s (8) 
y € B n , (9) 

where c is a vector containing n ones followed by 3m zeros 
and A is the matrix generated by adjoining matrices to 
H as 

A:=(h\ -21 1 -47 1 -8/) , (10) 

in which each / denotes the m x m identity matrix. 

There are a number of symmetries that color codes 
possess which allow one to significantly reduce the com- 
plexity of this binary IP. For example, if y satisfies the 
constraints of the IP, then so does y with any number 
of faces complemented. Since complementing the face of 
any optimal solution will not reduce its weight, we know 
that each face's sum will never be more than half the 
weight of that face. This means that for any particular 
instance of the IP specified by the syndrome vector s, the 
sums for the octagon and square faces can only take the 
syndrome-dependent values listed in Table II, thereby re- 
ducing the number of slack variables required. We take 
advantage of these kind of symmetries in the software we 
developed code for estimating the code capacity of 4.8.8 
triangular color codes. For example, we never need to 
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Octagon 


Square 


s = 


0, 2, 4 


0, 2 


s = 1 


1, 3 


1 



TABLE II: Possible values octagonal and square face check 
sums can take for an optimal IP solution if the face check 
sum's parity s is fixed. 



use three slack variables and some times we need none at 
all. 

Maximum likelihood decoding is generally an NP-hard 
problem, and the color codes do not appear to fall into 
an "easy" subset of instances. This is unfortunate be- 
cause their close cousins, the surface codes, do have ef- 
ficient MLE decoders that can be solved as a minimum- 
weight perfect matching problem [16]. Nevertheless, we 
can solve the associated IP for reasonably small instance 
sizes. 



2. Phenomenological noise MLE decoder 

In the phenomenological noise model, the syndrome 
values themselves can be faulty so we repeat the syn- 
drome extraction process a number of times equal to the 
distance of the code. In this setting, it is the difference 
in syndrome bit values from one time step to the next 
rather than the absolute values at particular times step 
that indicate data errors. This is because a single data 
error at one time step will lead to flipped syndrome bits 
for all future time steps (assuming that the syndrome ex- 
traction is not faulty), and such a syndrome-bit history 
should not imply that data errors occurred at each time 
step — it should imply that a data error occurred only at 
the time step when the syndrome bit first changed its 
value. The difference in persistence between data and 
syndrome errors is depicted in Fig. 9. The input to a 
MLE decoder is therefore the collection of syndrome dif- 
ference vectors for all time steps, namely 

As t = s t - s t -i = (Si + s t _i) mod 2 Vi, (11) 

where s := 0. 

For a distance d color code, the optimization problem 
to solve is again to minimize the number of errors given 
the observed syndrome, except we now have d time steps' 
worth of data-error vectors, xi, . . . , x<j, and d time steps' 
worth of syndrome-error vectors, ri, . . . , r<j, as variables 
in the optimization problem. Mathematically, we can 
write the optimization problem as 



fin £l' J 



(12) 



sto (Hx t + r t + r t _i) mod 2 = As t mod 2 Vt (13) 
x <= IB™. (14) 

As we did for the code-capacity scenario, we can collect 
these constraints into a single constraint and add slack 




(a) Measurement Error. 



(b) Data Error. 



FIG. 9: If syndrome qubits are also allowed to be in error, we 
repeat syndrome measurements. Time advances from bottom 
to top. Yellow circles indicate syndrome bits with the value 
1. Solid yellow circles indicate bit-flip errors. 



variables to make the problem a linear binary IP over 
the reals. Because the left-hand side of the constraints 
in Eq. (13) can sum to up to ten for octagon constraints 
and up to six for square constraints, three slack variables 
again suffice, allowing us to formulate the optimization 
problem as 



T 

mm c y 



sto Ay = As 

y eB", 



(15) 
(16) 
(17) 



where c is a vector containing [n + m)d ones followed by 
3md zeros, As is the vector (Asf, . . . , AsJ) T , y is the 
vector (xj 1 , . . . , 
matrix 



T 

n > 



rj, zj, z£ , and A is the 



(H 



A = 



H 



I 

I I 



H 



I I 



-21 



-41 -81 



(18) 



Finally, as we did for the code capacity setting, we can 
use symmetries to reduce the complexity of solving this 
IP; Table III summarizes what the possible values are for 
the square-faced and octagonal-faced constraints. 





Octagon 


Square 


s = 


0, 2, 4, 6 


0, 2, 4 


s = 1 


1, 3, 5 


1, 3 



TABLE III: Possible values octagonal and square face check 
sums can take for an optimal IP solution if the face check 
sum's parity s is fixed. 
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3. Circuit-level decoder 

In the circuit-level noise model, each component of the 
syndrome extraction circuit can fail with a probability 
that is a function of a parameter p, so that the over- 
all probability of a syndrome bit being in error, p s , is 
a complicated function of p. Even more dauntingly, the 
circuits can induce correlated errors between syndrome 
bits and between syndrome bits and data qubits. The 
phenomenological-noise model does not capture these 
noise correlations. 

We developed an MLE decoder for the circuit-level 
noise model that accounts for both these induced error 
correlations and the fact that in this noise model, single- 
qubit operations are subject to BP-channel noise while 
CNOT gates are subject to DP-channel noise. However, 
this decoder uses exponentially many more constraints 
than the phenomenological decoder as a function of code 
size. Because the IP decoder is already NP-hard, we 
opted not to study this truly MLE decoder but rather 
use the phenomenological-noise MLE decoder, which ig- 
nores these subtleties. Taking correlations into account 
will likely boost the accuracy threshold, but probably not 
by large factors [51]. By way of comparison, the thresh- 
old for the square-lattice surface code in the circuit-level 
noise model is 0.68% when the phenomenological decoder 
is used [52] (0.75% [17] when using a non-MLE decoder 
that takes into account some entropic effects), a thresh- 
old value that has recently been boosted to 1.1% [19] by 
accounting for some of the correlations in the noise. We 
leave the refinement of true MLE decoding of this noise 
model to others. 



IV. NUMERICAL ESTIMATE OF THE 

ACCURACY THRESHOLD FOR 
FAULT-TOLERANT QUANTUM ERROR 
CORRECTION 

A. Code capacity noise model 

Because the [n, l,d] triangular 4.8.8 color codes are 
CSS codes, when they are subject to BP-channel noise 
of strength p, their code capacity is the same as their 
bit-flip or phase-flip capacity; we focus on the bit-flip 
capacity here for definiteness. The number of distinct 
bit-flip syndromes is 2^- n ~ 1 ^ 2 and the number of distinct 
bit-flip errors is 2™. For small n, one can pre-solve the 
MLE decoding IP for each of the 2 (n ~ 1 ^ 2 distinct bit- 
flip syndromes. One can then iterate through each of the 
2 n distinct error patterns, compute its syndrome, and 
determine whether the combination of the error pattern 
plus the inferred correction by the IP leads to a logical 



operator, indicating failure of the decoding algorithm. 
Since error-correction is assumed to be error-free in this 
noise model, the corrected state is guaranteed to be in 
the codespace. Because (a) the logical bit-flip operator 
is transversal, (b) all stabilizer group elements have even 
weight, and (c) there are an odd number of qubits in 
every triangular code, it follows that one can identify a 
decoding failure quickly by computing whether the parity 
of the error pattern equals the parity of its IP-inferred 
correction; this means that is suffices to just store the 
parity of the inferred correction for each pre-computcd 
IP instance. The probability of failure, pf a n is therefore 

pmi= £ pW(i- P r-w, (19) 

failing patterns E 

where \E\ denotes the Hamming weight of the bit-flip 
error pattern E. 

We carried out this tabulation for the smallest trian- 
gular 4.8.8 color codes of distances 1, 3, 5, and 7 (corre- 
sponding to 1, 7, 17, and 31 qubits respectively) and com- 
puted the corresponding exact polynomials. To speed up 
the computation, we used several symmetries. For ex- 
ample, it suffices to examine only half of the error pat- 
terns because if the decoding algorithm succeeds on an 
error pattern, it fails on its complement and vice versa. 
Also, up to overall complementation, every error pattern 
can be uniquely expressed as the modulo-2 sum of an 
IP-inferred minimal-weight error pattern and a pattern 
where a bit-flip stabilizer group element has support. Fi- 
nally, the decoding algorithm is guaranteed to work on 
all errors whose weight is less than the code's distance, 
so those error patterns do not need to be examined. 

The formulas we obtained for the smallest codes of 
distance 1, 3, and 5 (code sizes 1, 7, and 17) are: 



p 




(20) 


p 7 + 7p 6 (l-p) + 28p i (l-pf 


i 


(21) 


f 7p 3 (l-pf + 21p 2 (l-pf 




(22) 


p 17 + I7p 16 (l-p) + 136p 15 (l 


-Pf 


(23) 


+- 348p 14 (l - pf + 725p 13 (l - 


Pf 


(24) 


+- 3861p 12 (l - pf + 4764p n (l 


-pf 


(25) 


+- 12136p 10 (l - p) 7 + 9747p 9 (l 


-pf 


(26) 


+- 14563p 8 (l - pf + 7312p 7 (l - 


-P) W 


(27) 


-1- 7612/(1 - p) 11 + 2327p 5 (l - 


-P) 12 


(28) 


+- 1655p 4 (l - pf 3 + 332p 3 (l - 


Pf 4 - 


(29) 



The formula we obtained for the distance-7 triangular 
4.8.8 color code (31 qubits) is a bit more hefty: 
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PfJi = P 31 + 31p 30 (l -p)+ 465p 29 (l - p) 2 + 4495p 28 (l - pf + 25658p 27 (l - pf + 96790p 26 (l - pf (30) 
+ 344858p 25 (l - pf + 1288630p 24 (l - p) 7 + 3742943p 23 (l - pf + 10488241p 22 (l - pf 
+ 21436239p 21 (l - p) 10 + 44259329p 20 (l - pf 1 + 67781868p 19 (l - p) 12 + 106951476p 18 (l - p) 13 
+ 127137964p 17 (l - p) 14 + 155845748p 16 (l - p) 15 + 144694447p 15 (l - p) 16 + 138044561p 14 (l - p) 17 
+ 99301599p 13 (l - p) 18 + 73338657p 12 (l - p) 19 + 40412986p u (l - pf° + 22915926p 10 (l - pf 1 
+ 9671834p 9 (l - pf 2 + 4145782p 8 (l - p) 23 + 134 9 45p 7 (l - pf 4 + 391423p 6 (l - p) 25 + 73121p 5 (l - pf 6 
+ 5807p 4 (l -p) 27 . 

I 



Our computing resources did not allow us to compute 
the exact polynomial for the next-sized code (distance 
9 code on 49 qubits), so we resorted to a Monte Carlo 
estimate for Pf a ii(p). We did this by first selecting three 
values of p near where we believed the threshold to be. 
For each p, we generated N trial error patterns drawn 
from the Bernoulli distribution, namely in which we ap- 
plied a bit-flip on each of the n qubits with probability 
p. We then inferred the syndrome for each error pattern 
and checked whether or not it led to a decoding failure 
for the MLE decoder. The optimal unbiased estimator 
for pf a ii that we used is 



(est) _ 



iVi 



Phil 



fail 



with a variance of 



(Jl \(est) 
Kail) 



(est; 
Pfeil 



N 



(31) 



(32) 



To get reasonably small error bars in these estimates, 
given where we believed the threshold to be, we chose 
N = 10 5 . The polynomials for Pf a n(p) are plotted in 
Fig. 10, including our three points of Monte Carlo data. 
From these plots, we estimate the accuracy threshold for 
this noise model to be 10.56(1)%. The error we report 
in this value comes from the error analysis method we 
describe in detail in the next section. 

To put our result in context, we reference Table I. 
The threshold value of 10.56(1)% we find is is slightly 
higher than the corresponding MLE threshold for the 
code capacity 10.31(1)% of 4.4.4.4 surface codes. In- 
tuitively this makes sense, as the 4.8.8 color code has 
both weight-8 and weight-4 stabilizer generators, both 
of which are modeled as being measured instantaneously 
and ideally. Being able to measure high-weight genera- 
tors quickly should improve the performance of a code, 
which is the effect we observe. 

Our threshold is also less than the threshold value of 
10.925(5)% for optimal decoding, which is also not sur- 
prising. As with the 4.4.4.4 surface codes, the reduction 
in threshold is not very significant. For both the surface 
codes and the 4.8.8 color codes, the accuracy thresh- 
old in the code capacity noise model corresponds to a 
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FIG. 10: Code capacity for the 4.8.8 triangular color codes. 
Pth = 10.56(1)%. Error bars on Monte Carlo data reflect 10 5 
instances studied at each of the three corresponding values of 
p. The inset figures are zoom-ins near the crossing point to 
show greater resolution there. 



phase transition in a random-bond Ising model (RBIM) 
of classical spins [16, 23]. For the color codes, the Ising 
model features 3-body interactions, whereas for the sur- 
face codes, the Ising model features 2-body interactions. 
The MLE decoder in both settings corresponds to the 
order-disorder transition in the spin model at zero tem- 
perature, whereas the optimal decoder corresponds to 
the order-disorder transition at the temperature along 
the so-called "Nishimori line," where the randomness in 
the bond couplings equals the randomness in the state 
arising from finite temperature fluctuations. In both the 
surface-code and color-code settings, the small decrease 
in accuracy threshold when going from optimal to MLE 
decoding reflects that the phase-boundary in these mod- 
els is re-entrant, but only by a small amount. Our results 
therefore imply a violation of the so-called Nishimori con- 
jecture [53, 54], which conjectures that the spin model 
shouldn't become more ordered as the temperature in- 
creases. The violation that our results imply is depicted 
in cartoon fashion in Fig. 11. To our knowledge, the vio- 
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lation of the Nishimori conjecture for the 3-body RBIM 
is unknown before our work. We expand more on this 
connection in Sec. VII B. 



Disordered 
Phase 



Algorithm 1 : Pf a n(p) by Monte Carlo 




FIG. 11: Phase diagram for 3-body random-bond Ising 
model. The dark circle is called the Nishimori point. The 
dotted line is the expected phase boundary given by the Nishi- 
mori conjecture. Our value of code capacity (10.56(1)%) es- 
tablishes that the T — intercept is P c ,o, while results of 
Ohzeki [25] (10.925(5)%) establish that the Nishimori point 
occurs at P c . Because Pc 7^ P c ,o, the Nishimori conjecture 
for this model is false. 



B. Phenomenological noise model 

In the phenomenological noise model, our fault- 
tolerant quantum error correction protocol repeats syn- 
drome extraction multiple times to increase the reliability 
of the syndrome bits. This causes the number of possi- 
ble error patterns for a given code size to grow so rapidly 
that obtaining exact curves for pt&ti(p) even for small code 
sizes is intractable. We therefore resorted to Monte Carlo 
estimates for these curves for even the smallest code sizes. 
The specific Monte Carlo algorithm we used for comput- 
ing pf a ii at a fixed value of p is listed in Algorithm 1. 

In words, Algorithm 1 creates an estimator for p^n 
by assessing the performance of many simulated trials of 
faulty quantum error correction. In each trial, errors are 
laid down, giving rise to an observed syndrome history. 
From the syndrome history, a correction is inferred. The 
actual error history and the inferred error history are 
XORed onto a single effective time slice, but the state in 
this effective time slice is not necessarily in the codcspace. 
To achieve this, a fictional ideal (error-free) round of error 
correction is simulated. If this succeeds (i.e., if it does 
not generate a logical bit- flip operation), then the trial 
is deemed a success; otherwise it is deemed a failure. By 
repeating many trials, one obtains an optimal unbiased 
estimator for the failure probability p^n , with mean and 
variance given by Eqs. (31-32), identical to the formulas 
relevant in the code capacity noise model setting. 

Our plots of pf a ii versus p for small-distance color codes 



1: n facC s 4- \{d + if ~ 
2: for i = 1 to N do 



/ / Generate data and syndrome errors for d time slices. 
for t — 1 to d do 

for j = 1 to n do 

E[t,j] 4- 1 with probability p. // Data errors. 

end for 

for j = n + ltora+l + rtfacos do 

E[t,j] 4- 1 with probability p. // Synd. errors. 
end for 
end for 

-Emm 4— Decode(Syndrome(_B)). // 3D error volume. 

E' 4- ( E[t] © E min [t]. 1 1 2D error plane. 

E' m i n 4- Decode(Syndrome(£'')). // JdeaJ decoding. 

if (©i^'M® £w»[i] = l) then 

Ar fai i 4- AT fail + 1. 
end if 



18: end for 

19: return = AT fail /JV. 



are depicted in Fig. 12. Just as for surface codes, the 
phenomenological noise MLE decoder can be mapped 
to a random-plaquette gauge model (RPGM) on clas- 
sical spins such that the zero-temperature order-disorder 
phase transition in the spin model corresponds to the ac- 
curacy threshold of the color codes. Because of this, as 
argued in Ref . [27] , the mutual intersection of the curves 
in Fig. 12 at the threshold p c corresponds to critical be- 
havior in the spin model such that the spin correlation 
length £ scales as 



£ ~ \p ~ Pc 



(33) 



where vq is a critical exponent set by the universality 
class of the spin model. 
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FIG. 12: Monte Carlo data used to estimate the accuracy 
threshold in the phenomenological noise model. 



13 



For a sufficiently large code distance d, then, the failure 
probability should scale as 

Pt^ = {p-Pc)d 1/W °. (34) 

We use our Monte Carlo data to fit to this form, but 
as in Ref. [27], we allow for systematic corrections com- 
ing from finite-size effects that create a constant offset. 
Specifically, we use the method of differential corrections 
[55] to fit the curves to the form 

p fai] = A + Bip-p^d 1 '" . (35) 

The linear fits to our data are plotted in Fig. 13. Using 
the software of Ref. [55], we found the following values 
for p c and i/q ■ 

p c = 0.030 534 ± 0.000 385 (36) 
v = 1.486 681 ± 0.166 837. (37) 



C. Circuit-level noise model 

As with the phcnomcnological noise model, computing 
Pfaii(p) exactly even for small code sizes is intractable, so 
we again appeal to Monte Carlo estimation. Our Monte 
Carlo simulation algorithm is similar to Algorithm 1, ex- 
cept the manner in which the error pattern E is generated 
is different. To generate E, we simulate BP and DP chan- 
nel noise as described by the noise model on the explicit 
circuit given for syndrome extraction. This results in a 
correlated error model for syndrome and data qubits. We 
then use the phenomenological noise MLE decoder and 
assess success or failure as we did for that noise model. 

We estimated the Pf a ii(p) curves for several small 4.8.8 
triangular color codes for both the A-then-Z schedule of 
Fig. 6 and the interleaved X-Z schedule of Fig. 7. Our 
results are plotted in Figs. 14 and 15. 
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FIG. 13: Linear fit near curve crossings of phenomenological- 
noise-model Monte Carlo data. Estimated accuracy threshold 
is p th = 3.05(4)%. 

To put our results in context, as we did in the code ca- 
pacity setting, we reference Table I. For the same reasons 
as in the code capacity noise model setting, the threshold 
we compute is larger than the MLE decoder's threshold 
for the 4.4.4.4 surface codes. We conjecture that is it 
also measurably less than the threshold for the optimal 
color-code decoder, as is the case for optimal vs. MLE 
decoding for surface codes. So far, the threshold for opti- 
mal decoding of 4.8.8 color codes has not been estimated, 
but the analysis for optimal decoding of 6.6.6 color codes 
suggests that the threshold will be near 4.5%. If true, our 
data would signal a violation of the Nishimori conjecture 
for the RPGM associated with the 4.8.8 color code, some- 
thing we are not aware of being reported elsewhere. 

Finally, we note that while the value of Uq is consistent 
with value of i/q = 1.463(6) obtained for the 4.4.4.4 sur- 
face code [27] and the 6.6.6 color code, the uncertainty in 
the value we obtained is too high to draw any meaningful 
conclusions. 



FIG. 14: Monte Carlo data used to estimate accuracy thresh- 
old in the circuit-based noise model in which the noninter- 
leaved syndrome extraction circuit is used. 

To compute the accuracy thresholds from our data, 
we again fit our data near the crossings to an equations 
whose form is similar to that of by Eq. (35). However, the 
motivation for such a fit is a bit more tenuous in this case 
because while the MLE decoder we are using maps to a 
RPGM, the noise model which generates it is correlated. 
For this reason, as also found in Ref. [27], we found it 
necessary to include a quadratic term, unlike the case for 
the pure phenomenological noise model. In other words, 
we fit our data to an equation of the form 

p laa = A + B(p- Vc )d x l va + C( P - p c fd 2 ' va . (38) 

The quadratic fits to our data for the A-then-Z sched- 
ule are plotted in Fig. 16. Again using the software of 
Ref. [55] , we found the following values for p c and vq for 
the A-then-Z schedule: 

p c = 0.000 820 ± 0.000 022 (39) 
v = 1.350 954 ± 0.079 188. (40) 
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FIG. 15: Monte Carlo data used to estimate the accuracy 
threshold in the circuit-based noise model in which the inter- 
leaved syndrome extraction circuit is used. 



To be clear, there is both a Z-error and an X-error 
accuracy threshold; we report the smaller of the two here. 




Error 



FIG. 16: Quadratic fit near curve crossings of noninterleaved- 
circuit circuit-based-noise-model Monte Carlo data. Esti- 
mated accuracy threshold is p t h = 0.082(3)%. 

Similarly, for the XZ-intcrlcavcd schedule we found 



Pc = 0.000 800 ± 0.000 037 
v Q = 1.509 871 ± 0.151 690. 



(41) 
(42) 



To remind, our results are for the smaller of the X- 
error and Z-error thresholds. 

Our results show that despite our efforts to shorten the 
schedule of the syndrome extraction circuit, the impact 
on the resulting accuracy threshold is essentially indis- 
tinguishable. The value of 0.082(3)% for the accuracy 
threshold for MLE decoding of the 4.8.8 color codes in the 



circuit-level noise model is about a factor of ten less than 
the the corresponding 0.68% accuracy threshold for MLE 
decoding of 4.4.4.4 surface codes in the circuit-level noise 
model. We believe that the difference comes from the fact 
that the 4.8.8 codes have some weight-8 stabilizer genera- 
tors while the 4.4.4.4 codes only have weight-4 stabilizer- 
generators. This causes the circuits for extracting the 
syndrome for the weight-8 generators in the 4.8.8 codes 
to be larger, inviting more avenues for failure. Indeed, we 
have investigated the finite-sized error-propagation pat- 
terns for the 4.8.8 codes such as the one depicted in Fig. 8, 
and they are significantly larger and more complex than 
the corresponding patterns for the 4.4.4.4 surface codes. 
Expanding this line of reasoning, we predict that the 6.6.6 
color codes will have an MLE-decoded accuracy thresh- 
old in the circuit-based noise model that is somewhere 
between the 4.8.8 and 4.4.4.4 accuracy thresholds in this 
noise model. 



V. ANALYTIC BOUND ON THE ACCURACY 
THRESHOLD FOR FAULT-TOLERANT 
QUANTUM ERROR CORRECTION 

While numerical estimates of the accuracy threshold 
are valuable, equally valuable are analytic proofs that the 
accuracy threshold is no smaller than a given value. One 
method of obtaining such a lower bound is to use the self- 
avoiding walk (SAW) method, first proposed in Ref. [16]. 
The idea behind this method begins with the observation 
that our goal is to lower-bound the failure probability of 
decoding, which is the probability that the actual errors 
plus the inferred correction (modulo 2) lead to an error 
chain that corresponds to a logical operator. For color 
codes, logical operators can be not only string-like but 
also string-net like, as described in the original paper on 
color codes [20]. They must also have a Pauli- weight at 
least as large as the distance of the code. The probability 
that a logical operator is present in the post-corrected 
state is therefore at least as large as the probability that 
an error-chain string of Pauli-weight equal to the code 
distance is present. Certainly this is a very pessimistic 
bound; there are many error chain strings and string-nets 
of this Pauli weight that do not result in failure! 

The SAW lower-bound method can be applied rela- 
tively straightforwardly to the code-capacity and phc- 
nomenological noise models with MLE decoding. The 
method begins to break down when applied to the circuit- 
level noise model with phenomenological MLE decoding. 
One reason for this is that the circuit introduces corre- 
lated errors, called "hooks" in Ref. [16], which suggest 
that the SAW bounding the failure probability should be 
allowed to sometimes take more than one step in a sin- 
gle iteration. With some finesse, this can be accounted 
for and bounded as in Ref. [16]. However, for the color 
codes, the steps need not be path-connected either. For 
example, the circuit may create three separated errors 
on a single octagon plaquette. Calling such a process 
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a "walk" or attempting to bound the behavior of the 
process by a true SAW method is dubious at best. For 
this reason, we have chosen to omit bounding the accu- 
racy threshold in the circuit-level noise model and instead 
have bounded the accuracy threshold only for the other 
two noise models, as described below. 



Fig. 17. To our knowledge, the connective constant for 
this lattice is not known, but it could be computed in 
principle using standard methods, e.g., those outlined in 
Refs. [56-58]. We opted to bypass this analysis and in- 
stead compute a coarser bound on the failure probability. 



A. Code capacity noise model 

As argued by Dennis et al. in Ref. [16], the probability 
that an [n, k, dj topological code decoded by an error-free 
MLE decoder fails is upper-bounded by the probability 
that a self-avoiding walk creates a closed path (i.e., a 
self-avoiding polygon or SAP) of length d or greater: 



Pfail 



< 



^Prob SA p(d) 



L>d 



<n^n SA p(L) (4p(l-j>)) 



L/2 



(43) 
(44) 



L>o 



Self-avoiding walks on the 4.8.8 lattice have been stud- 
ied, and it is known that the number of self-avoiding poly- 
gons of length L on the lattice scales asymptotically as 
[56] 



n SAP (L) < P(L)vi s . s , 



£t 4 .8.8~l-808 830 01(6), (45) 



where P is a polynomial and £44.8.8 is the so-called con- 
nective constant for the 4.8.8 lattice. (The value £44.8.8 
has been rigorously bounded to be 1.804 596 < £44.8.8 < 
1.829 254 [57, 58].) For small p, each summand in 
Eq. (43) is upper-bounded by the term with L = d, and 
the number of summands is at most a polynomial in d, 
so that pf a ii — ¥ as d — > 00 as long as 



P (i 



1 



(46) 



8.8 



Solving this equation for p, we find that the code ca- 
pacity threshold is at least 



p c > 8.335 745(1)%. 



(47) 



Despite the crudeness of the SAW bound, it comes 
surprisingly close to the numerical value of 10.56(1) that 
we estimate in Sec. IV A. 



B. Phenomenological noise model 

The SAW bound method is essentially the same as for 
the code capacity noise model, except now errors can hap- 
pen on syndrome qubits as well as data qubits and the 
set of all relevant qubits forms a three-dimensional vol- 
ume. The relevant SAW traverses a 3D lattice that con- 
nects syndrome qubits and data qubits both with them- 
selves and each other as dictated by the color code; the 
corresponding nonregular prismatic lattice is depicted in 




FIG. 17: Prismatic lattice on which a self-avoiding walk oc- 
curs in the analysis of the accuracy threshold for fault-tolerant 
quantum error correction using color codes in the phenomeno- 
logical noise model. 

Because the lattice in Fig. 17 has vertices of degree A 
equal to 6, 8, and 10, we can bound the number of SAPs 
of length L by 



nsAp{L) < 2A max (2A 

n 



1 



(48) 



Using A max = 10, we obtain a formula similar to that of 
Eq. (46), namely 



p(l-p) < 



1 



4(9) 2 



1 

324' 



(49) 



Solving this equation for p, we find that the phe- 
nomenological noise threshold is at least 



Pi 



> 



0.3096%. 



(50) 



This bound is nearly a factor of ten less than the value 
of p c = 3.05(4)% that we estimate in Sec. IV B. With 
further computational effort in determining the connec- 
tive constant of the governing lattice, we suspect that the 
SAW bound will still be below our numerical estimate, 
but significantly closer, in analogy with the relationship 
between our SAW bound for the code capacity and the 
value we estimate numerically. We leave this analysis to 
others wishing to tighten this bound. 
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VI. FAULT-TOLERANT COMPUTATION WITH 
COLOR CODES 

To establish a threshold for fault-tolerant quantum 
computation, it is sufficient to establish three things: 1) a 
threshold for fault-tolerant quantum error correction, 2) 
a procedure for performing a universal set of gates in en- 
coded form, and 3) that a failure in an encoded gate that 
occurs with probability p leads to failures in each output 
codeword with probability at most p. These three ingre- 
dients establish that each gate in a quantum circuit can 
be simulated fault-tolerantly by performing it in encoded 
form followed by fault-tolerant quantum error correction. 
We previously established the first criterion in Sec. III. 
We establish the second two criteria here for two possible 
computer architectures. 

In the first, which we call the "pancake architecture," 
each logical qubit is stored in its own triangular 4.8.8 
color code and the logical qubits are stacked atop one 
another. This architecture is essentially the same as the 
one proposed in Ref. [16]. Almost all encoded opera- 
tions are implemented transversally in this model, acting 
on single "logical qubit pancakes" or between two such 
"pancakes." In the second, which we call the "defect 
architecture," each logical qubit is stored as a connected 
collection of missing check operators, which we call a "de- 
fect," in a single 2D 4.8.8 substrate. This architecture is 
essentially the same as the one proposed in Ref. [17]. 
Almost all encoded operations are performed in one of 
two ways: encoded single-qubit gates are performed by 
disconnecting a region containing the defect, operating 
transversally on the region, and reconnecting the region, 
while the encoded CNOT gate is performed by a sequence 
of local measurements that cause one defect to circulate 
around another. 



A. Fault-tolerance by transversal gates 

In this section, we compute the threshold for fault- 
tolerant quantum computation with triangular 4.8.8 color 
codes when (almost) all encoded gates are implemented 
transversally. To remind, by calling a gate "transversal," 
we mean that it acts identically on all physical qubits in 
a code block. For example a two-qubit transversal gate 
between two triangular codes acts as the same two-qubit 
physical gate between corresponding physical qubits in 
each code block. Some authors refer to this notion of 
transversality as strong transversality [59]. 



IV) —BP— QEC 



FIG. 18: Noisy identity gate. BP indicates the action of the 
BP channel. 



Formally, we can express the equivalence between the 
accuracy threshold for the identity gate and the accuracy 
threshold for fault-tolerant quantum error correction as 



M) _ (QEC) 

Pth — Pth 



2. CNOT gate 



(51) 



The color codes are Calderbank-Shor-Steane (CSS) 
codes [47, 48], and for all such codes, the encoded 
controlled-NOT (CNOT) gate can be implemented 
transversally, namely by applying CNOT gates between 
corresponding pairs of physical qubits in two color codes. 
(For color codes, fewer CNOT gates than a fully transver- 
sal set also suffice.) Schematically, Fig. 19 depicts a noisy 
CNOT gate. Each physical CNOT gate propagates the 
BP channel on its control to the BP channel on its target 
and vice versa, so that the effective noise model seen by 
the fault-tolerant quantum error correction procedure on 
each code block after the encoded CNOT gate is the BP 
channel followed by the projection of the two-qubit DP 
channel onto a single qubit. Although the DP channel 
can create correlated errors between output code blocks, 
it will never cause a correlated error within a code block. 
Since our decoder treats the noise model phenomenolog- 
ically, it does not account for DP-channel features such 
as the fact that in the DP channel a Y error is more 
probable than the combination of separate X and Z er- 
rors. For this reason, since half of the DP-channel errors 
act as a bit-flip on a given code block and half of them 
act as a phase-flip on a given code block, our decoder 
interprets the post- CNOT noise model as a BP chan- 
nel with an effective error rate of p + p/2 for bit flips 
and p + p/2 for phase flips. This means that the ac- 
curacy threshold for the CNOT gate is actually 2/3 of 
the value for the identity gate. The CNOT gates used 
in an encoded CNOT gate must therefore meet a more 
stringent requirement than the identity gate to be im- 
plemented transversally fault-tolerantly. (However, the 
CNOT gates used in fault-tolerant quantum error correc- 
tion still only need to meet the threshold for the encoded 
identity gate.) 



(cnot) _ 2 m 

Pth ~ ^Pth 



(52) 



1. Identity gate 



The accuracy threshold for the identity gate is exactly 
the same as the accuracy threshold for fault-tolerant 
quantum error correction, by definition. Schematically, 
Fig. 18 depicts the noisy identity gate circuit. 



3. Hadamard gate 

The color codes are strong CSS codes, meaning that 
the X-type and Z-type stabilizer generators have the 
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FIG. 19: Noisy CNOT. BP indicates the action of the BP 
channel; D indicates the action of the DP channel. 



FIG. 21: Noisy phase gate. B indicates the action of the bit- 
flip channel; P indicates the action of the phase-flip channel. 



same structure. As with all strong CSS codes, the en- 
coded Hadamard gate (H) can be implemented transver- 
sally. 

Like the CNOT gate, the Hadamard gate propagates 
the BP channel to the BP channel. However, since faults 
in the Hadamard gate are modeled as an ideal Hadamard 
gate followed by the BP channel, the effective noise model 
is not one but two actions of the BP channel, as depicted 
in Fig. 20. 
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FIG. 20: Noisy Hadamard BP indicates the action of the BP 
channel; D indicates action of the DP channel. 

It is straightforward to show that two successive ap- 
plications of the BP channel with probability p is equiv- 
alent to one application of BP channel with probability 
2p(l — p). This is therefore the effective post- Hadamard 
noise channel, so that the threshold for the Hadamard 
gate is about half of that for fault-tolerant quantum er- 
ror correction: 

vW = \-\\ll-*%"\p%- (55) 
4- Phase gate 



is p 3 + 3p(l — p) 2 . The phase gate thus has separate 
thresholds for bit-flip and phase-flip noise. For bit-flip 
noise, the threshold is 

t&^ = \-\^-^™\$- (59) 

For phase-flip noise, one must solve a cubic equation to 
get a closed-form solution for the threshold as a function 
of the threshold for the identity gate. While this is pos- 
sible in principle, to save space we simply state the cubic 
equation in the variable x = p(^>P has °- fll P) must be 
solved and its approximate solution, which we can es- 
timate because we know that the accuracy threshold is 
very close to 0: 

x 3 + 3x(l-x) 2 =p i t I h ) , (60) 
x*~pg\ (61) 

5. Single-qubit measurements 

To destructively apply the encoded single-qubit mea- 
surements Mx and Mz , we transversally measure X or Z 
on each of the qubits in the code block. We then perform 
classical error correction on the measurement outcomes 
(because they may be faulty) to infer the outcome of 
the encoded measurement, as depicted schematically in 
Fig. 22. 



The color codes have the feature that each stabilizer 
generator for the code has a Pauli weight equal to mod 
4 and each pair of generators are incident on mod 2 
qubits. One can show that because of this, the encoded 
phase gate (S) has a transversal implementation [20, 60]. 
(Technically, it is the transversal operation that acts 
as an encoded S.) 

While a faulty phase gate acts as an ideal phase gate 
followed by a BP channel, the phase gate itself does not 
propagate the BP channel preceding it symmetrically for 
bit flips and phase flips. This follows from the conjuga- 
tion actions 



SXS t = Y = iXZ 



SZS^ 



(56) 



The phase gate therefore propagates a phase flip to a 
phase flip and a bit flip to both a bit-flip and a phase 
flip, as depicted in Fig. 21. 

The phase gate is correspondingly more sensitive to 
phase-flip noise because the effective phase-flip strength 
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FIG. 22: Noisy measurements. B denotes the bit-flip chan- 
nel, P denotes the phase-flip channel, and CEC denotes clas- 
sical error correction of the measurement outcomes. Post- 
measured states are drawn with double lines to indicate that 
they are "classical." 

The correctness of this procedure follows from the fact 
that X and Z operators can be expressed as Z = S 2 
and X = HZH, and the encoded operations H and S 
have previously been demonstrated to have transversal 
encoded implementations. Bit or phase errors (as rele- 
vant) before a measurement then map to bit errors on 
the observed classical bit pattern. 

The reason the measurement is destructive is that af- 
ter the measurement, the qubits are no longer in the 
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codespace of the color code; the post-measured state is 
not projected onto an X or Z eigenstate in the codespace. 
However, as pointed out by Steane [46], given the abil- 
ity to prepare encoded |+) states, a circuit composed of 
transversal CNOT and transversal destructive Mj mea- 
surements can implement nondestructive Mx measure- 
ments transversally. A similar story holds for encoded 
|0) states and Mz measurements. The circuits for gen- 
erating these nondestructive measurements transversally 
are depicted in Fig. 23. Because the encoded |0) and 
|+) states are being used to enable gates, namely nonde- 
structive encoded measurements, these states are called 
"magic states" for the gates [61]. Ordinarily, quantum er- 
ror correction would follow not just one, but both of the 
outputs of the encoded CNOT gate in these circuits, but 
because one of the encoded qubits is destructively mea- 
sured immediately after the CNOT gate, that encoded 
qubit does not require quantum error correction; it will 
be effectively performed by the classical error correction 
process occurring after the destructive measurement. 
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FIG. 23: Circuits for nondestructive encoded Mx and Mz, 
using the states |0) and |+) as "magic states." 

The threshold for destructive Mz and Mx measure- 
ments is the same as the code capacity threshold for the 
code, regardless of which noise model we are considering. 
This is because the physical measurements are made only 
once, as repetition cannot improve their effective error 
rate. The (flawless) classical error correction performed 
in post-processing has a threshold equal to the code ca- 
pacity threshold. Hence, we have the result that 
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Although these measurements need only be smaller 
than the code capacity threshold to implement the en- 
coded measurement, when these measurements are used 
in the fault-tolerant quantum error correction protocol, 
they must be smaller than the threshold set by the pre- 
vailing noise model — a threshold that may be signifi- 
cantly lower. 

To compute the threshold for nondestructive Mz and 
Mx measurements, we examine how errors propagate 
through the circuits in Fig. 23. As with the analysis of 
Fig. 19, the effective noise channel we need to consider af- 
ter the CNOT gate is the BP channel followed by the DP 
channel on each output. One of these enters a destruc- 
tive measurement, which, as we found in the analysis of 



Fig. 22, has a rather high threshold equal to the code 
capacity even in the circuit-level noise model. However, 
it is lowered slightly by the fact the effective error rate 
is |p, as discussed in the analysis of the encoded CNOT 
gate. The other output enters a standard quantum error 
correction circuit, also subject to noise of strength |p. 
Since the lowest threshold of these two thresholds is this 
one, the overall threshold for an encoded nondestructive 
measurement is the same as the threshold for the encoded 
CNOT gate. Namely, we have the result that 
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6. |0) and +) preparation 

It is tempting to assert that the way to fault-tolerantly 
prepare the encoded |0) state is to perform an encoded 
nondestructive Mz measurement. The flaw with this 
reasoning is that the nondestructive Mz measurement 
requires the encoded |+) state as a magic state, and the 
analogous way of preparing a |+) state requires a |0) 
state. 

To get out of this chicken-and-egg cycle, one must use 
an independent process. We describe a two-step process 
that works for preparation of an encoded |0) state; the 
process for preparing an encoded |+) state is similar. 

The first step is to prepare the product state |0)® n by 
transversally measuring Mz on each physical qubit. This 
state is a stabilizer state, having n check operators, with 
check operator i being Z on qubit i for i = 1, . . . n. The 
second step is to fault-tolerantly measure the X checks 
for the color code. Because the only Z-type operators 
consistent with all the X checks are the color codes' Z 
checks for the color code and the logical Z operator, these 
measurements will transform the state into the logical |0) 
state. 

It turns out that it is not necessary to also fault- 
tolerantly measure the Z checks for the color code. The 
state is already in an eigenstate of these operators at 
this point, so all the measurements can do is yield syn- 
drome bits. Had one obtained these bits and processed 
them, the post-corrected state would still have been sub- 
ject to X errors drawn from the same distribution as the 
X errors afflicting the initial |0)®" preparation — fault- 
tolerant error correction doesn't suppress the final error 
rate to zero, it only keeps it at the same rate one started 
with. 

The threshold for preparation of encoded |0) and |+) 
states is therefore the same as the threshold for fault- 
tolerant quantum error correction, namely, 
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It is worth noting that while the process for fault- 
tolerantly preparing |0) and |+) states is not strictly 
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transversal, the only nontransversal operation is fault- 
tolerant quantum error correction, a process that is re- 
quired in addition to transversal operations in any event 
in order to achieve fault-tolerant quantum computation. 



then "distilled" using encoded gates until the resultant 
| tt/4) states have an error below the accuracy threshold. 
In the second, high-quality |7r/4) states are first distilled 
and then injected into the code. The circuit depicted in 



7. T gate 

Another gate that admits a transversal implementation 
with a magic state is the T gate, also called the tt/8 gate, 
defined as 
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If we we have an encoded version of the state 
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also called \A) and I-A^m) in the literature, we can im- 
plement the T gate transversally using the circuit of 
Fig. 24. This circuit is not a Clifford circuit, because the 
classically-controlled S gate is not a Clifford gate. Never- 
theless, it only uses gates that we have previously shown 
how to implement in encoded form by purely transversal 
operations. 
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FIG. 24: Magic-state circuit for the T gate. 

To compute the T gate threshold, we again study error 
propagation through its defining circuit, viz.the circuit in 
Fig. 24. As shown previously, the CNOT gate creates an 
input to the first QEC cycle that has a threshold of 2/3 
of the standard QEC threshold. The S gate creates an 
input to the second QEC cycle which splits the thresh- 
old into bit-flip and phase-flip thresholds approximately 
equal to 1/2 and 1/3 of the standard QEC threshold. 
The threshold for the T gate is set by the smallest of 
these, namely the S gate threshold, which is 
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8. 7r/4) preparation 



There are two alternatives for preparing encoded 1 7r/4) 
states fault-tolerantly described in the literature. In the 
first, low-fidelity |7r/4) states are "injected" into the code 
by teleportation, using the circuit in Fig. 25 [10], and 
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FIG. 25: Circuit for injecting a single-qubit magic state M. 
The circuit for multi-qubit magic states is similar. 

Fig. 25 is not fault-tolerant, but faults are already sup- 
pressed by the code on the encoded qubits; only opera- 
tions from the latter-half of the decoding circuit onwards 
arc unprotected. 

Unlike all of the previous encoded gates, this method 
for implementing an encoded |7r/4) preparation requires 
an operation which is neither transversal nor fault- 
tolerant quantum error correction. The "unencoding" 
portion of the circuit is the time-reversed coherent cir- 
cuit for encoding a state in the color code, derivable via 
standard stabilizer codes as shown in Ref. [50]. This 
unencoding circuit does not appear to have a transver- 
sal implementation. While the Eastin-Knill theorem [62] 
asserts that at least one nontransversal operation is re- 
quired to generate a universal set of encoded gates, it 
does not guarantee that no transversal implementation 
of this circuit exists. That is because the process of fault- 
tolerant quantum error correction used to prepare |0) and 
|+) states is not transversal. For 3D color codes [63], 
in which T is intrinsically transversal and in which en- 
coded |0) and |+) states still require fault-tolerant quan- 
tum error correction for preparation, only transversal and 
FTQEC operations are needed, for example. It would be 
interesting to develop a variant of the circuit in Fig. 25 
which only uses transversal operations and possibly fault- 
tolerant quantum error correction to inject a |7r/4) state 
into 2D color codes. We leave that for others to explore. 

While the portion of the circuit in Fig. 25 in which the 
physical \M) state interacts with the unencoded qubit 
via a CNOT appears to also not be transversal, it can 
be made so with slight modification. In principle, one 
could prepare n states of the form \M) and transversally 
apply the CNOT gate between these and the code block, 
but only the one qubit corresponding to the unencoded 
state will be used to classically control the X and Z gates 
that are used to inject the correct state. As usual, these 
corrections do not need to be actually implemented, only 
used to update the Pauli frame. 

Both alternatives for preparing high-quality encoded 
|7r/4) states require a procedure for magic-state distil- 
lation. One option is to use the encoding circuit for 
the 15-qubit Reed-Muller code [6] (also the smallest 3D 
color code [63]) run in reverse, as depicted in Fig. 26. 
For it to work, the initial states must have an error less 
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than the |7r/4) distillation threshold. For the circuit de- 
picted in Fig. 26, the distillation threshold for indepen- 
dent, identically distributed (iid) depolarizing noise is 
(6 - 2\/2)/7 « 45.3% [64, 65], for dephasing iid noise 
is (V2- l)/\/2 « 29.3% [64, 66], and for worst-case iid 
noise is (y/2-l)/2y/2 w 14.6% [64, 66]. The entire circuit 
must be run 0(poly(e -1 )) times to achieve an output 
error less than e; convergence should be quite rapid in 
practice given the actual polynomial [64]. Various tricks 
can be used to boost the distillation threshold and reduce 
the resources required to achieve high-fidelity states; any 
of these can be readily adapted to this setting. 
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FIG. 26: Distillation circuit for 7r/4) states; it is the 15-qubit 
Reed-Muller code's encoding circuit in reverse. 



9. Synthesis 

It is well-known result the gate basis 
{H, S, CNOT, M X ,M Z , |0), |+), |tt/4)} is universal 
for quantum computation [36] (in fact, it is even over- 
complete). We have presented transversal methods for 
performing color-code encoded versions of each of these 
except for the state preparations. By the Eastin-Knill 
theorem [62] , it is impossible to generate a complete uni- 
versal encoded gate basis in transversal form. However, 
color codes offer a particularly gentle way around this 
theorem. There are only two nontransversal operations 
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used. The first is fault-tolerant quantum error correc- 
tion, a process that is required in addition to encoded 
computations in any event for the entire protocol to 
be fault tolerant. The second is the time-reversed 
coherent encoding circuit for color codes. Such a circuit 
is useful for encoding unknown quantum states, but 
in an actual quantum computation, the input state is 
known so it is not needed for this purpose. Whether 
this "unencoding circuit" can be replaced with another 
operation which uses only transversal operations and 
fault-tolerant quantum error correction is an interesting 
open question. For 3D color codes, we know that the 
answer is "yes." 

The "pancake architecture," described in Ref. [16] for 
the Kitaev surface-codes, realizes the encoded gate set 
we described using only gates between spatially neigh- 
boring qubits. One difference in our analysis from that 
performed in Ref. [16] is that we have analyzed the accu- 
racy threshold not only for fault-tolerant quantum mem- 
ory but also for fault-tolerant quantum computation, a 
feat made tractable by the strong CSS nature of the color 
codes. 



B. Fault-tolerance by code deformation 

The method of fault-tolerance described in Sec. VIA 
requires a three-dimensional architecture to allow the 
transversal CNOT gates to remain spatially local. This 
violates the spirit of using two-dimensional codes in the 
first place. Fortunately, it is possible to use code de- 
formation to achieve fault-tolerance in a strictly two- 
dimensional architecture. Our construction here mirrors 
that of Raussendorf et aVs construction for surface codes 
[17, 67]. Fowler has independently constructed a method 
for using code deformation in 4.8.8 color codes that is 
similar to ours [49] . Some salient differences between our 
method and Fowler's are that (i) Fowler's logical qubits 
are always encoded in a triple of defects whereas ours are 
encoded in single defects except during certain logical 
gates, and (ii) Fowler's scheme disallows different defect 
types from occupying the same plaquette location while 
ours does not. Each of these differences allows our scheme 
to encode a higher density of information. Specifically, 
our scheme allows a six-fold increase in logical qubit den- 
sity over the Fowler scheme. 

To begin, we generate a sufficiently large 4.8.8 trian- 
gular color code by performing fault-tolerant quantum 
error correction on a collection of qubits. We are not 
interested in what state the triangular code encodes — 
all we require is that the state is in the codespace with 
arbitrarily high fidelity. We consider any logical qubits 
associated with the entire surface to be "gauge" qubits 
in the language of subsystem stabilizer code theory [68] . 
We will use this state as a substrate for generating and 
manipulating encoded qubits. 

Each element of the standard set of stabilizer genera- 
tors for a color code can be labeled by a face of a definite 
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color (red, green, or blue) and an operator of a definite 
Pauli type (X or Z). Notationally, we will refer to a gen- 
erator as a (c, P) generator if it is of color c and Pauli type 
P. To prepare an encoded qubit in our color code sub- 
strate, we remove a connected product of stabilizer gen- 
erators of the same color and type. (Generally removal 
of any element of the stabilizer group will yield a logical 
qubit; we restrict attention to this class for simplicity.) 
We call this removed region a defect in analogy with the 
language used by Raussendorf et al. in Ref. [17]. This 
removal is entirely passive — we simply cease measuring 
this product of stabilizer generators in future quantum 
error correction rounds. For this reason, it is manifestly 
a fault-tolerant process. 

In the following sections, we describe how to perform 
a universal repertoire of encoded logic gates on defect- 
based logical qubits, with arbitrarily high fidelity. This is 
therefore a prescription for fault-tolerant quantum com- 
putation using code deformation. 



1. Preparing a defect in 0) or +) 

In principle, the generator removed to form a defect 
qubit can be identified with any element of the encoded 
Pauli group for that encoded qubit. For concreteness, 
we make the choice of calling the removed generator a 
logical Z when it is Z-type defect (also called a 'primal' or 
'smooth' defect in the language of Ref. [17]) and a logical 
X when it is X-type defect (also called a 'dual' or 'rough' 
defect in the language of Ref. [17]). Thus removing a c- 
colored X- or Z-type generator corresponds to preparing 
a logical |+)( c ,x) or |0)(c,z) state respectively 

The logical Z operator for a (c, X) defect acts as Z on a 
c-colored chain of qubits connecting the defect to another 
c-colored boundary, which may itself be another defect. 
If no such other boundary exists, then the defect fails 
to encode a logical qubit. To avoid this complication, 
we have chosen our substrate to be a triangular code, 
having boundaries of each of the three colors. Similarly, 
the logical X operator for a (c, Z) defect acts as X on 
a c-colored chain of qubits connecting the defect to a c- 
colored boundary. 

Preparing a |+)( c ,z) or |0)( c .x) state requires more 
care. To do this, we measure M x or M z respectively 
along a c-colored chain of qubits from the plaquette we 
wish to store the logical qubit in and the nearest c-colored 
boundary. This projects each qubit along the chain into 
either |+) or |— ) (resp. |0) or |1)), which we can interpret 
as |+) (resp. |0)) for each qubit by changing local Pauli 
bases. We then measure the Z-checks (resp. AT-checks) 
incident on this chain except the one at the defect loca- 
tion and correct any errors, which places the defect back 
into the substrate in the desired state. 

An arbitrarily large c-colored defect can be prepared 
in a single step by ceasing to measure a collection of 
c-connected defects by a similar process, enabling the 
preparation process to be made arbitrarily reliable. This 



introduces a number of "gauge" qubits in the interior of 
the defect that can be ignored; the details of this are 
described in the next section. 



2. Growing, shrinking, and moving defects 

We grow a (c, P) defect qubit on region q in the fol- 
lowing way. Suppose we would like to extend the defect 
so that it includes an adjacent region q' of the same color 
and type. (By adjacent, we mean that the regions can 
be connected by a single two-qubit c-colored link.) To 
do this, we first perform the following conditional opera- 
tion. If P = X , then we measure ZZ on a c-colored link 
connecting the regions, while if P — Z, then we measure 
XX on a c-colored link connecting the regions. Exam- 
ples of how this works for octagonal and square defects 
are depicted in Figs. 27 and 28; the circuit in Fig. 29 
implements this transformation. A YY operator can be 
used to grow a X and Z-type defect at the same time. 

(a) Octagonal green Z defect. (b) Growth to two defects. 



FIG. 27: Growth of an octagonal green Z defect by one site. 



(a) Square red Z defect. (b) Growth to two defects. 



FIG. 28: Growth of a square red Z defect by one site. 
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FIG. 29: Measurement of XX to grow a Z-type defect. The 
measurement can be performed with existing circuitry already 
in place for syndrome extraction. 

After this measurement, the new collective defect op- 
erator is the product of the q and q' defect operators. 
The ±XX or ±ZZ operator has also been added to the 
list of stabilizer generators. As usual, we do not need to 
actually correct the result to a +1 outcome: it suffices 
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to update the Pauli frames of the stabilizer generators 
incident on these two interior qubits. 

Because we will no longer use the weight-two opera- 
tor, we may consider it to also be a "gauge" operator 
in the language of subsystem stabilizer codes [68]. This 
also makes its anticommuting partner a gauge operator, 
which we may interpret to be either of the original de- 
fect operators (on q or q'). By introducing these two 
new gauge operators, we may reinterpret the defect log- 
ical operator on the collective q and q' region as acting 
solely on its boundary. In particular, the interior of the 
collective q and q' region need never be involved in future 
syndrome extractions. 

An important question is whether the defect growth 
process is fault-tolerant. The simplest circuit for mea- 
suring XX or ZZ would perform CNOT gates into or 
out of an ancilla qubit to each of the two relevant qubits, 
as depicted in Fig. 29. Although a single error in this 
ancilla qubit could propagate to two errors on the two in- 
terior qubits, because we subsequently treat these qubits 
as encoding a gauge qubit, we do not worry about errors 
on these. It could still be the case that the value of the 
measurement obtained is incorrect, which impacts the 
update of the Pauli frame of the two adjacent stabilizer 
generators in a correlated way. Thus a single syndrome 
measurement error would propagate to two syndrome-bit 
errors. To prevent this happening to first order in the er- 
ror probability, we repeat the XX or ZZ measurement 
twice and use the majority vote of the three outcomes to 
update the Pauli frame. 

Compared to the process of defect growth, defect con- 
traction is much simpler: to shrink a defect by a single- 
plaquette, one simply measures that plaquette operator 
in the next round of fault-tolerant quantum error correc- 
tion. 

By a combination of local growth and shrinking pro- 
cesses, one can deform the code with a (c, P) defect at 
one plaquette to a code with a (c, P) defect anywhere 
else. In other words, the move operation for a defect can 
be decomposed into a sequence of more elementary grow 
and shrink operations. 



3. Measuring a defect 

To destructively measure the logical operator encir- 
cling a defect, one first shrinks the defect to size of a 
single plaquette. Then one measures the defect with the 
existing circuitry at that plaquette as though it were a lo- 
cal stabilizer generator. The shrunken defect will have a 
significantly lower tolerance to one type of Pauli error but 
that error type is in the basis being measured in and will 
not disturb the measurement outcome. To destructively 
measure the string-like logical operator connecting two 
defects, one brings the two operators as close together 
as possible. One then measures the weight-two operator 
connecting the defects using the circuitry used to grow a 
defect from one site to encompass the other. Again, the 



tolerance to errors of one Pauli type will be significantly 
lower, but this will not be of the type that disturbs the 
measurement. 

To nondestructively measure a defect, one uses the cir- 
cuit of Fig. 23, which uses destructive measurement of 
M z or Mx, preparation of |0) or |+), and the CNOT 
gate described in the next section. 



4- CNOT gate between defects 

It is straightforward to show that moving a (c, Z) de- 
fect qubit around a {c',X) defect qubit (or vice- versa) 
generates an encoded CNOT gate controlled by the (c, Z) 
defect when c and d are different colors; the construc- 
tion is essentially the same as that in Refs. [17, 49, 67]. 
Since this process traces out a braid in spacetime, we 
call this process "braiding defects." Also drawing upon 
Refs. [17, 67], one can generate a CNOT gate between 
two Z-type defects or two X-type defects, whether they 
are the same color or not. The circuit for doing this be- 
tween two Z-type defects is depicted in Fig. 30; the cir- 
cuit for doing this between two X-type defects is similar. 
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FIG. 30: Circuit for braiding a CNOT gate between Z-type 
defects. The colors c and c' may be the same or different, 
but the color c" is a color different from these. The circuit 
for braiding a CNOT gate between X-type defects is simi- 
lar: the CNOT gate directions are reversed, the types of the 
defects and the types of the measurements have their Pauli 
types swapped from X to Z and vice- versa, and the |0) state 
becomes a |+) state and vice- versa. 

One can convert an X-type defect into a Z-type defect, 
or vice-versa, (changing its color as a side effect) using 
one of the circuits in Fig. 31. In conjunction with the 
other type of CNOT gates mentioned, this allows CNOT 
gates between two defects regardless of the colors or Pauli 
types they have. 
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FIG. 31: Circuits for converting a Z-type defect into an X- 
type defect and vice-versa. 
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5. Phase gate on a defect 

To perform an S (phase) gate on a (c, P) defect, we 
prepare two more qubits of (c', P) and (c", P) type, each 
in the state |0) and use CNOT gates to put the defect 
into a three-defect repetition code. This maps our single- 
defect logical qubits into the three-defect logical qubits 
Fowler uses in his construction [49]. We then grow the 
defects and connect them so that they separate an in- 
terior triangular region from an exterior region, just as 
described in Fowler's construction. If P = Z, then as 
Fowler noted, it suffices to apply S transversally (actu- 
ally, S 1 ^ must be applied transversally) to generate a logi- 
cal S on the triple-defect qubit. However, if P = X, then 
Fowler's construction fails, because the "exterior trees" 
in his language fail to undergo the action SXS 1 * = Y. 
"Pruning" the exterior tree as Fowler suggests for his im- 
plementation of the Hadamard gate fails as well, because 
such an operation yields only the "byproduct operator" 
for logical X or Z on the triple-defect qubit, but not both. 
To perform the S gate on X-type defects, we propose the 
following two-step procedure. First, we arrange the de- 
fects to separate a triangular interior from the exterior 
and apply transversally on the interior. Second, we re- 
arrange the defects so that part of what was the exterior 
becomes the new interior, and perform transversally 
on this new triangular interior. In this way, both the in- 
terior and exterior trees experience the S gate. Once the 
gate is complete, we run the three-defect encoding circuit 
in reverse and absorb the two ancilla qubit regions back 
into the substrate in subsequent quantum error correc- 
tion rounds. 

Of course, the S gate can also be achieved via magic 
states of the form |vr/2) := ^= (|0) +e" r / 2 |l)) (also called 
\Y) or | +i) in the literature) in a manner similar to what 
is done for surface codes. But this is one of the great 
benefits of 4.8.8 color codes — no magic state distillation 
and usage is required to realize this gate in encoded form. 
It may well be worth the lower accuracy threshold of 
color codes relative to surface codes in order to achieve 
the resource reduction for performing encoded S gates. 



6. Hadamard gate on a defect 

From one point of view, a logical Hadamard gate is un- 
necessary because it can be implemented using the gates 
we have previously described, for example by the circuit 
of Fig. 32. However, we have developed a more resource- 
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FIG. 32: Circuit for simulating H with previously-described 
gates. 

efficient way to perform this gate that we describe here. 



If we want to perform a Hadamard gate on a (c, P) 
defect, we first prepare an ancilliary (c, P) defect in the 
state |0) and perform a CNOT gate from the defect qubit 
to this defect ancilla using the circuit of Fig. 30. This en- 
codes the original defect qubit into the two-qubit bit-flip 
repetition code across the two defects. The ZZ opera- 
tor for the two defect qubits is in the stabilizer group 
of this repetition code, so we can measure ZZ without 
disturbing the encoded qubit. The logical Z operator is 
a c'-colored chain of Z operators around either of the de- 
fects and the logical X operator is a c-colored chain of 
X operators connecting the defects, where d ^ c. This 
encoding is the one used at all times in the Raussendorf 
et al. scheme [17, 67], but here we only use it to perform 
the Hadamard gate and go back to our original single- 
defect encoding once the gate is completed. 

After we've encoded the defect qubit into two, we then 
perform individual Mz measurements on a c'-colored 
chain of qubits surrounding both defects, where c' 7^ c. 
This separates the region of the two qubits from the sub- 
strate, so we can then apply H transversally on the cut- 
out region without influencing the external substrate. 
This operation applies a logical Hadamard gate to the 
two defect qubits in the interior, but also turns them 
into P'-type defect qubits in the process, where P' is 
conjugate to P {i.e., P' = Z if P = X, and P' = X if 
P = Z). We then stitch the cut out region back into the 
code by measuring the P-check operators incident on the 
cut. The encoding circuit for the repetition code is run 
in reverse, and the resulting defect can be converted back 
to its original type and color using circuits of the form 
depicted in Fig. 31. 



7. Injecting |7r/4) into a defect 

To perform universal encoded quantum computation 
with defects, our approach requires defects encoded into 
the state \tt/4) with an error below its distillation thresh- 
old, as discussed in the previous "pancake" architecture. 
We therefore need a method for injecting magic states 
into defects such that the injection process introduces 
errors at a rate below the distillation threshold. The 
single-qubit preparation threshold for a magic state is 
therefore the difference between its distillation threshold 
and the error introduced by its injection process. 

It is worth remarking that this kind of injection pro- 
cess is used in defect-based surface code schemes as well 
[17, 49, 69]. In these schemes, one must not only in- 
ject |7r/4) states, but also inject \ir/2) states as well. 
However, the impact of errors introduced by errant in- 
jection has not been studied to our knowledge. It is un- 
clear whether considering it will significantly alter the 
high threshold values numerically estimated for surface 
codes — the difference between a 1% accuracy threshold 
and a 14% distillation threshold is not that great, so it 
is reasonable to expect that it may be quite important, 
especially because injection generates small-sized defects 



24 



that are not arbitrarily well- protected from noise at first. 
We do not investigate the impact of the injection process 
on the threshold for color codes here either, but we ex- 
pect that it will be less consequential because the value 
of the color-code accuracy threshold is much lower than 
that for the surface codes. 

To inject into a (c, Z) defect, we identify the corner of 
the triangular substrate containing the c-colored plaque- 
tte and measure M z on the qubit in the corner, isolating 
it from the code. We then apply TH to the corner qubit 
and then measure the weight-four X check in the corner, 
bringing the corner qubit back into the code. We then 
cease measuring the Z check in the standard way, creat- 
ing a single-plaquette Z-type defect in the corner. This 
defect is not well protected from noise, so we move it 
from the corner and grow it as fast as we can, so that the 
ambient noise doesn't degrade the fidelity of the encoded 
state. 



8. T gate on a defect 

Given \n/4) defect qubits, we can distill them and use 
them to perform the T gate in the same way as described 
in Sees. VI A 7 and VI A 8 for the "pancake" architecture. 

VII. CONCLUSIONS 
A. Fault-tolerant quantum computation 

We studied fault-tolerant quantum computation using 
color codes, inspired by (a) the need to minimize qubit 
transport in real technologies having 2D layouts and (6) 
the high accuracy thresholds reported for similar topo- 
logical codes. We framed our study with a well-defined 
quantum control model and three physically-motivated 
noise models of increasing realism which we call the code- 
capacity noise model, the phenomenological noise model, 
and the circuit-based noise model. 

The strategy behind our study was to first understand 
how to fault-tolerantly simulate the identity gate via 
fault-tolerant quantum error correction and then extend 
this understanding to how to fault-tolerantly simulate a 
universal set of quantum gates capable of general-purpose 
quantum computation. 

In the course of studying fault-tolerant quantum er- 
ror correction, we formulated most-likely-error decoding 
for color codes as a mathematical optimization problem 
known as an integer program. We also developed fea- 
sible schedules for parallelized syndrome extraction for 
the most efficient family of color codes, the 4.8.8 color 
codes. To better understand the performance of our 
decoder, we elaborated a previously-established connec- 
tion between the performance of our decoder and some 
statistical-mechanical classical spin models. 

Our numerically-estimated value for most-likely-error 
fault-tolerant quantum error correction for 4.8.8 color 



codes in the code-capacity noise model is 10.56(1)%. This 
is not significantly different from what had previously 
been estimated for optimal decoding of these and the 
6.6.6 color codes, or most-likely-error or optimal decod- 
ing of Kitaev's 4.4.4.4 surface codes. Indeed, the upper 
bound for any CSS code is slightly more than 11%, so 
all of these codes perform close to optimally in this noise 
model. To support our numerical estimate, we proved 
that the threshold is at least 8.335 745 (1)% using a self- 
avoiding walk technique. 

Our numerically-estimated value for the accuracy 
threshold of most-likely-error fault-tolerant quantum er- 
ror correction for 4.8.8 color codes in the phenomeno- 
logical noise model is 3.05(4)%. Again, this is not sig- 
nificantly different from what had previously been esti- 
mated for optimal decoding of the 6.6.6 color codes, or 
most-likely-error or optimal decoding of Kitaev's 4.4.4.4 
surface codes. We attribute the nominal improvement we 
find relative to Kitaev's surface codes for both this and 
the previous noise model to the fact that the color codes 
have higher-weight stabilizer generators, which should 
be modeled as more errant, but which aren't in these 
noise models. To support our numerical estimate, we 
proved that the threshold is at least 0.3096% using a 
self-avoiding walk technique. 

Our numerically-estimated value for most-likely-error 
fault-tolerant quantum error correction for 4.8.8 color 
codes in the circuit-based noise model is 0.082(3)%. By 
attempting to optimize the syndrome extraction circuit 
by hand, we ended up surprisingly decreasing our thresh- 
old estimate to 0.080(3)%, suggesting that optimizing the 
syndrome extraction circuit to find the highest threshold 
is a nontrivial task. Unlike our findings for the previ- 
ous two noise models, our accuracy-threshold estimate is 
in fact significantly different from what had previously 
been estimated for most-likely-error decoding of Kitaev's 
4.4.4.4 surface codes — it is nearly a tenth the comparable 
value of 0.68%. That said, it is consistent with the value 
of "about 0.1%" estimated using a different sub-optimal 
decoder for these codes considered in Ref. [22]. However, 
the estimate in Ref. [22] lacked any error analysis, so it is 
hard to determine how consistent these results truly are. 
We believe that the reduction in threshold relative to the 
surface code threshold comes from the increased weight 
of the stabilizer generators for the 4.8.8 color code. Based 
on this, we predict that the 6.6.6 color codes will have 
a quantum error-correction accuracy threshold for this 
noise model somewhere between 0.082(3)% and 0.68% 
without any additional optimizations. We did not prove 
a lower bound on the threshold in this noise model, as the 
self-avoiding walk technique breaks down for this noise 
model. 

To extend our results to general-purpose fault-tolerant 
quantum computing, we considered two different ap- 
proaches. In the first, the architecture consisted of 2D 
surfaces stacked like pancakes in which each surface cor- 
responded to a logical qubit and almost all operations 
were either global transversal operations or local syn- 
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drome extraction operations. In the second, the archi- 
tecture consisted of an extended 2D surface in which log- 
ical qubits were associated with "defects" and almost all 
operations were either defect braiding by local measure- 
ments or local syndrome extraction operations. 

In the "pancake" architecture, we showed that en- 
coded universal quantum computation was possible us- 
ing only local stabilizer measurements, global transversal 
operations, and the time-reversed coherent encoding cir- 
cuit for the color code, which was used to inject magic 
states. Each gate in this architecture has its own accu- 
racy threshold that is a significant fraction of the quan- 
tum error correction (memory) threshold. 

In the "defect" architecture, we showed that encoded 
universal quantum computation was possible using only 
local stabilizer measurements, code deformation, and 
transversal operations on isolated regions. These de- 
formations came in different forms, including growing 
small defects into large ones, braiding defects around 
each other for encoded CNOT gates, and isolating defects 
from the rest of the code. Each gate has the same accu- 
racy threshold as the quantum error correction (memory) 
threshold, although errors afflicting injected magic state 
defects before they are grown to full size may dominate 
the threshold for the less realistic code-capacity and phe- 
nomenological noise models. 

Because the defect architecture has a higher threshold 
and is more consistent with the original motivation for 
our study — namely that many technologies are restricted 
to a single 2D layout — we believe the defect-based ap- 
proach to be the most practical. To that end, we ex- 
tended some of the defect-based approach for color codes 
presented in Ref. [49] so that a significantly higher den- 
sity of defects can be stored and processed in the surface. 



B. Relation to statistical-mechanical phase 
transitions 

It has been previously established that there is a 
mapping between quantum color codes and a classical 
statistical-mechanical model known as the three-body 
random-bond Ising model (3BRBIM). In this mapping, 
each check maps to a classical ±1 spin and each qubit 
maps to a three-body interaction, with the interaction 
being ferromagnetic if the qubit is not in error and anti- 
ferromagnetic if it is. Specifically, the Hamiltonian con- 
structed by this mapping is 

h = e j i n 5 - ( ?4 ) 

qubits q checks cBq 

where J q € ±1 indicates a flip on qubit q and S c € ±1 
indicates the eigenvalue of the check c. 

A feature of the mapping is that the code capacity for 
any particular decoding algorithm represents a point on 
the boundary of the order-disorder transition of the as- 
sociated 3BRBIM. Our integer-programming decoder is 



an "energy-minimizing" decoder in this paradigm, cor- 
responding to the phase boundary at zero temperature. 
Because our code-capacity value of 10.56(1)% is lower 
than the code capacity of 10.925(5)% of a "free-energy- 
minimizing" decoder implicitly explored by Ohzeki [25], 
this demonstrates that the phase boundary of the 3BR- 
BIM is "re-entrant" as depicted in Fig. 11, violating the 
so-called Nishimori conjecture for this system. This re- 
sult is counterintuitive because it states that the 3BR- 
BIM can become more ordered by increasing the tem- 
perature, depending on the system's quenched disorder 
parameter. It would be exciting to see experimental con- 
firmation of this effect. 



C. Future directions 

While we have been able to answer many questions 
about fault-tolerant quantum computing using color 
codes, practicalities have necessarily limited the focus of 
our analysis, leaving other related questions open. Our 
results also also raise new questions that we believe are 
worthy of study. 

One future direction we mentioned is optimizing the 
syndrome extraction circuit. One could also examine us- 
ing more elaborate ancilla states in the circuit, such as 
those used in the schemes proposed by Shor [1], Steane 
[46], and Knill [10]. In any scheme one chooses, further 
improvement may still be possible by transforming the 
circuit used in an implementation. 

Another future direction we alluded to is optimizing 
the decoding algorithm. One could examine the per- 
formance of the truly optimal decoder for the circuit 
model which accounts for the correlations in the noise 
induced by the syndrome extraction circuit. This will 
yield an upper bound on the accuracy threshold for the 
noise modcl(s) studied. On the other end of the spec- 
trum, it would be useful to explore the performance of 
faster decoders which don't yield as high a threshold as 
the MLE decoder but which may be more valuable in 
practice. The renormalization group decoder [70] and 
minimum-weight perfect matching decoder [16] (using a 
mapping of one color code to two Kitaev surface codes 
[24]) are examples of this. Another alternative is to gen- 
eralize the results of Feldman et ai, who developed an 
efficient linear-program decoder for binary codes based 
on an integer-program-based decoder similar to the one 
we developed here [71]. 

The lower bound technique of self- avoiding walks that 
we used is certainly not the tightest, and it may be of 
interest to establish tighter lower bounds. For tighter 
bounds, it may be possible to use different techniques. 
In the case of the circuit-based noise model, the self- 
avoiding walk bound technique breaks down dramati- 
cally, and it would be worth exploring other lower-bound 
techniques in this setting. 

While we believe the noise and control model that we 
studied is reasonable, it is certainly not unique and can 
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be improved upon with more experimental input. As 
shown by Levy et ai, [34, 35], when more realistic models 
are included, conclusions regarding fault tolerance can 
change dramatically. Even at an abstract level, one could 
modify our depolarizing noise model for CNOT gates so 
that it acted ideally with probability 1—p and applied one 
of the fifteen nontrivial Pauli operators with probability 
p/15 each rather than acting ideally with probability 1 — 
p and applying one of the sixteen Pauli operators with 
probability p/16. 

While we gave a prescription for injecting magic states 
into the color code for both the pancake and 2D defect- 
based architectures, we did not carefully study the 
threshold of the circuits used for injection. To our knowl- 
edge, this type of study has not been performed for Ki- 
taev's surface codes either. Such studies would be valu- 
able, as it could be the case that the magic state prepara- 
tion threshold is actually less than the accuracy threshold 
reported for all of the other gates, even though the dis- 
tillation threshold for the magic states is higher than the 
accuracy threshold for the other gates. 

Finally, the connection between color codes and the 
three-body random-bond Ising model allowed us to ex- 
plore the structure of order-disorder transition in the lat- 
ter model by studying the former. This is one of the 
rare examples where a purely quantum information the- 
oretic result has led to greater understanding of a clas- 
sical system. Kitaev's surface codes and the two-body 
random-bond Ising model have a similar connection and 
have admitted a similar study [16, 27]. It is clear that it is 
the CSS structure of these codes that admits these stud- 
ies; one could argue that every CSS code is a topological 



code for some topology, having an associated classical 
statistical-mechanical model for a given quantum noise 
model. It might be interesting to use the fault-tolerant 
decoding of CSS codes generally as a tool to explore re- 
lated statistical-mechanical systems with quenched dis- 
order. 
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